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Inten >nal Application No 
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A. CLASSIFICATION OF SUBJECT MATTER 

IPC. 7 C12N15/82 C12N15/29 A01H5/02 



According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 7 C12N 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 

STRAND, EPO-Internal , WPI Data, PAO, BIOSIS 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category " 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


X 


WO 98 13503 A (F B INVESTMENTS PTY LTD 


1,2, 




;TEASDALE ROBERT DIXON (AU)) 


8-14, 




2 April 1998 (1998-04-02) 


23-25, 






31,32 


Y 




1,2, 






4-18, 






23-29, 






31,32 




abstract 






page 1, line 13 - line 19 






page 3, line 10 - line 25 






page 4, line 2 -page 5, line 11 






page 6, line 24 -page 7, line 10 






page 9, line 6 - line 8 
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Further documents are listed in the continuation of box C. 
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* Special categories of cited documents : 

"A" document defining the general state of the art which is not 

considered to be of particular relevance 
"E" earlier document but published on or after the international 

filing date 

'f document which may throw doubts on priority daim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

"O" document referring to an oral disclosure, use, exhibition or 
other means 

*P a document published prior to the international filing date but 
later than the priority date claimed 



T later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X" document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken atone 

"Y" document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

document member of the same patent family 
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Tel. (+31-70) 340-2040, Tx. 31 651 eponl, 
Fax: (+31-70) 340-3016 


Authorized officer 

Ceder, 0 
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8-16, 
23-27, 
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HEIDELBERG DE . 
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the whole document 
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CELL, US, CELL PRESS, CAMBRIDGE, NA, 

vol. 50, 31 July 1987 (1987-07-31), pages 
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abstract 
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International application No. 

PCT/US 99/24407 



Box I Obs rvations wh re c rtain claims were found unsearchable (Continuation of item 1 of first sheet) 

This International Search Report has not been established in respect of certain claims under Article 1 7(2)(a) for the following reasons: 
1. | | Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



| | Claims Nos.: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 



3. | | Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 

see additional sheet 



1. 



I I As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
' ' searchable claims. 



2 * I I As a " searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 



3. I y | As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
' covers only those claims for which fees were paid, specifically claims Nos.: 



1,2,4-18,23-29,31,32 (inventions 1-4) 



4. 1 1 No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark n Protest [ | The additional search fees were accompanied by the applicant's protest. 

| X | No protest accompanied the payment of additional search fees. 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: partly: 1, 2, 8-14, 23-25, 31, 32 and completly: 4, 
15, 26 



An isolated nucleic acid molecule comprising a floral organ 
selective regulatory element operativly linked to a 
nucleotide sequence encoding a cytotoxic gene product, a 
transgenic plant characterised by suppressed flowering 
containing it and a method and kit for producing the plant, 
where the floral organ selective regulatory element is AGL2 
(Seq Id No 1). 



2. Claims: partly: 1, 2, 8-14, 23-25, 31, 32 and completly: 5, 
16, 27 



An isolated nucleic acid molecule comprising a floral organ 
selective regulatory element operativly linked to a 
nucleotide sequence encoding a cytotoxic gene product, a 
transgenic plant characterised by suppressed flowering 
containing it and a method and kit for producing the plant, 
where the floral organ selective regulatory element is AGL4 
(Seq Id No 2). 



3. Claims: partly: 1, 2, 8-14, 23-25, 31, 32 and completly: 6, 
17, 28 



An isolated nucleic acid molecule comprising a floral organ 
selective regulatory element operativly linked to a 
nucleotide sequence encoding a cytotoxic gene product, a 
transgenic plant characterised by suppressed flowering 
containing it and a method and kit for producing the plant, 
where the floral organ selective regulatory element is AGL9 
(Seq Id No 3). 



4. Claims: partly: 1, 2, 8-14, 23-25, 31, 32 and completly: 7, 
18, 29 



An isolated nucleic acid molecule comprising a floral organ 
selective regulatory element operativly linked to a 
nucleotide sequence encoding a cytotoxic gene product, a 
transgenic plant characterised by suppressed flowering 
containing it and a method and kit for producing the plant, 
where the floral organ selective regulatory element is API 
(Seq Id No 10). 
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5. Claims: partly: 1, 3, 8-13, 19, 23, 24, 30, 31, 33 

An isolated nucleic acid molecule comprising a floral organ 
selective regulatory element operativly linked to a 
nucleotide sequence encoding a cytotoxic gene product, a 
transgenic plant characterised by suppressed flowering 
containing it and a method and kit for producing the plant, 
where the cytotoxic gene product is diphtheria toxic A chain. 



6. Claims: partly: 1, 3, 8-13, 19, 23, 24, 30, 31, 33 

An isolated nucleic acid molecule comprising a floral organ 
selective regulatory element operativly linked to a 
nucleotide sequence encoding a cytotoxic gene product, a 
transgenic plant characterised by suppressed flowering 
containing it and a method and kit for producing the plant, 
where the cytotoxic gene product is RNase Tl. 



7. Claims: partly: 1, 3, 8-13, 19, 23, 24, 30, 31, 33 

An isolated nucleic acid molecule comprising a floral organ 
selective regulatory element operativly linked to a 
nucleotide sequence encoding a cytotoxic gene product, a 
transgenic plant characterised by suppressed flowering 
containing it and a method and kit for producing the plant, 
where the cytotoxic gene product is Barnase RNase. 



8. Claims: partly: 1, 3, 8-13, 19, 23, 24, 30, 31, 33 

An isolated nucleic acid molecule comprising a floral organ 
selective regulatory element operativly linked to a 
nucleotide sequence encoding a cytotoxic gene product, a 
transgenic plant characterised by suppressed flowering 
containing it and a method and kit for producing the plant, 
where the cytotoxic gene product is ricin toxin A chain. 



9, Claims: partly: 1, 3, 8-13, 19, 23, 24, 30, 31, 33 

An isolated nucleic acid molecule comprising a floral organ 
selective regulatory element operativly linked to a 
nucleotide sequence encoding a cytotoxic gene product, a 
transgenic plant characterised by suppressed flowering 
containing it and a method and kit for producing the plant, 
where the cytotoxic gene product is herpes simplex virus 
thymidine kinase. 



10. Claims: partly: 13 and completly: 20-22 

A method for producing a transgenic plant having suppressed 
flowering by intoducing a nucleic acid molecule comprising a 
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floral organ selective regulatory element operativly linked 
to a nucleotide sequence encoding a cytotoxic gene product 
by Agrobacteri urn-mediated transformation. 
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Applicant's or agent's file reference 
1 133.01 1W01 


See Notification of Transmittal of International 
FOR FURTHER ACTION Preliminary Examination Report {Form PCT/IPEA/416) 


international application No. 
PCT/US99/24407 


International filing date (day/month/year) Priority date (day/month/year) 
15/10/1999 16/10/1998 


International Patent Classification (IPC) or national classification and IPC 
C12N 15/00 


Applicant 

THE REGENTS OF THE UNIVERSITY OF CALIFORNIA et al. 



1 . This international preliminary examination report has been prepared by this International Preliminary Examining Authority 
and is transmitted to the applicant according to Article 36. 



2. This REPORT consists of a total of 8 sheets, including this cover sheet. 

□ This report is also accompanied by ANNEXES, i.e. sheets of the description, claims and/or drawings which have 
been amended and are the basis for this report and/or sheets containing rectifications made before this Authority 
(see Rule 70.16 and Section 607 of the Administrative Instructions under the PCT). 

These annexes consist of a total of sheets. 



3. This report contains indications relating to the following items: 



S Basis of the report 

□ Priority 

S Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

IS Lack of unity of invention 

S Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations suporting such statement 

□ Certain documents cited ^-v ^ — — ^ — . _ 

□ Certain defects in the international application V-/ It IT £ZL V*/ I I J 

E3 Certain observations on the international application 



IV 
V 

VI 
VII 
VIII 



versio 



Date of submission of the demand 



15/05/2000 



Date of completion of this report 



09.02.2001 



Name and mailing address of the international 
preliminary examining authority: 
European Patent Office 

D-80298 Munich 
Tel. +49 89 2399 - 0 Tx: 523656 epmu d 

Fax: +49 89 2399 - 4465 



Authorized officer 
Heimann-Pohl, B 

Telephone No. +49 89 2399 8713 
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INTERNATIONAL PRELIMINARY 
EXAMINATION REPORT 



International application No. PCT/US99/24407 



I. Basis f the report 

1 . This report has been drawn on the basis of (substitute sheets which have been furnished to the receiving Office in 
response to an invitation under Articie 14 are referred to in this report as "originaliy filed" and are not annexed to 
the report since they do not contain amendments (Rules 70. 16 and 70. 17).): 

Description, pages: 

1-30 as originally filed 

Claims, No.: 

1 -33 as originally filed 

Drawings, sheets: 

1 /43-43/43 as originally filed 

Sequence listing part of the description, pages: 

1 -21 , filed with the letter of 22.02.00 

2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language: f which is: 

□ the language of a translation furnished for the purposes of the international search (under Rule 23.1 (b)). 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under Rule 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

□ contained in the international application in written form. 

□ filed together with the international application in computer readable form. 
E3 furnished subsequently to this Authority in written form. 

IS furnished subsequently to this Authority in computer readable form. 

IS The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in 
the international application as filed has been furnished. 

IS The statement that the information recorded in computer readable form is identical to the written sequence 
listing has been furnished. 

4. The amendments have resulted in the cancellation of: 
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□ 

n 
□ 



the description, 
the claims, 



the drawings, 



sheets: 



pages: 
Nos.: 



5. □ This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

(Any replacement sheet containing such amendments must be referred to under item 1 and annexed to this 
report.) 

6. Additional observations, if necessary: 

111. Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

1 . The questions whether the claimed invention appears to be novel, to involve an inventive step (to be non- 
obvious), or to be industrially applicable have not been examined in respect of: 

□ the entire international application. 
E3 claims Nos. 3, 19-22,30,33. 



□ the said international application, or the said claims Nos. relate to the following subject matter which does 
not require an international preliminary examination (specif^): 



□ the description, claims or drawings (indicate particular elements beloW) or said claims Nos. are so unclear 
that no meaningful opinion could be formed (specify): 

□ the claims, or said claims Nos. are so inadequately supported by the description that no meaningful opinion 
could be formed. 

IS no international search report has been established for the said claims Nos. 3,19-22,30,33. 

2. A meaningful international preliminary examination report cannot be carried out due to the failure of the nucleotide 
and/or amino acid sequence listing to comply with the standard provided for in Annex C of the Administrative 
Instructions: 

□ the written form has not been furnished or does not comply with the standard. 

□ the computer readable form has not been furnished or does not comply with the standard. 

IV. Lack of unity of invention 

1 . In response to the invitation to restrict or pay additional fees the applicant has: 
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because: 
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□ restricted the claims. 
El paid additional fees. 

□ paid additional fees under protest. 

□ neither restricted nor paid additional fees. 

2. □ This Authority found that the requirement of unity of invention is not complied and chose, according to Rule 

68.1 , not to invite the applicant to restrict or pay additional fees. 

3. This Authority considers that the requirement of unity of invention in accordance with Rules 13.1, 13.2 and 13.3 is 

□ complied with. 

H not complied with for the following reasons: 
see separate sheet 

4. Consequently, the following parts of the international application were the subject of international preliminary 
examination in establishing this report: 

□ all parts. 

Kl the parts relating to claims Nos. 1 ,2,4-1 8, 23-29, 31 ,32. 

V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 

1. Statement 



Novelty (N) Yes: Claims 2,4-7,14-18,25-29,31,32 

No: Claims 1,8-13,23,24 

Inventive step (IS) Yes: Claims 

No: Claims 2,4-7,14-18,25-29,31,32 

Industrial applicability (IA) Yes: Claims 1 ,2,4-1 8, 23-29,31 ,32 

No: Claims 



2. Citations and explanations 
see separate sheet 



VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 
see separate sheet 
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1) . The present application relates to transgenic plants, specifically trees important in 

wood production, containing a nucleic acid molecule including a floral organ 
selective regulatory element linked to a nucleotide sequence encoding a cytotoxic 
gene product. 

2) . Unity (Box IV) 

The IPEA agrees with the ISA in regard to reasons for the non-unity objection 
which are the following: 

Methods for modifying plants to increase vegetative growth of commercially 
valuable plant structures at the expense of non-essential and non-commercial 
structures are already known in the art. In WO9813503 (document D1) a method 
for producing a transgenic plant with enhanced vegetative growth is presented. In 
the method the plant is produced by introducing an expression cassette containing 
a structural gene for Barnase RNase under the control of a promoter that will 
cause specific expression of the gene in floral tissue. This leads to a decreased 
floral tissue growth and an increased vegetative tissue growth in the transgenic 
plant. 

Due to the fact that nucleic acid constructs comprising a floral organ selective 
regulatory element operatively linked to a nucleotide sequence encoding a 
cytotoxic gene product, and the use of said construct to produce a transgenic 
plant is already known from D1, the problem underlying the present application 
(as far as it had been subject to the Search Report) is the provision of further floral 
tissue specific promoters and transgenic plants containing them. 

The requisite unity of invention (Rule 13.1 PCT) therefore no longer exists 
inasmuch as a technical relationship involving one or more of the same or cor- 
responding special technical features in the sense of Rule 1 3.2 PCT does not 
exist between the subject-matter of the following groups of dependent claims: 

1. Claims : partly: 1,2, 8-14, 23-25, 31, 32 and completely: 4, 15, 26 

An isolated nucleic acid molecule comprising a floral organ selective 
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regulatory element operatively linked to a nucleotide sequence encoding a 
cytotoxic gene product, a transgenic plant containing said molecule and 
characterized by suppressed flowering and a method and kit for producing 
said plant, where the floral organ selective regulatory element is AGL2 (SEQ 
ID NO: 1). 

2. Claims : partly: 1, 2, 8-14, 23-25, 31, 32 and completely: 5, 16, 27 

An isolated nucleic acid molecule comprising a floral organ selective 
regulatory element operatively linked to a nucleotide sequence encoding a 
cytotoxic gene product, a transgenic plant containing said molecule and 
characterized by suppressed flowering and a method and kit for producing 
said plant, where the floral organ selective regulatory element is AGL4 (SEQ 
ID NO: 2). 

3. Claims : partly: 1, 2, 8-14, 23-25, 31, 32 and completely: 6, 17, 28 

An isolated nucleic acid molecule comprising a floral organ selective 
regulatory element operatively linked to a nucleotide sequence encoding a 
cytotoxic gene product, a transgenic plant containing said molecule and 
characterized by suppressed flowering and a method and kit for producing 
said plant, where the floral organ selective regulatory element is AGL9 (SEQ 
ID NO: 3). 

4. Claims : partly: 1, 2, 8-14, 23-25, 31, 32 and completely: 7, 18, 29 

An isolated nucleic acid molecule comprising a floral organ selective 
regulatory element operatively linked to a nucleotide sequence encoding a 
cytotoxic gene product, a transgenic plant containing said molecule and 
characterized by suppressed flowering and a method and kit for producing 
said plant, where the floral organ selective regulatory element is AP1 (SEQ 
ID NO: 10). 

In response to an invitation to pay additional fees or to restrict the claims, the 
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applicant has chosen to pay additional fees for inventions 2-4. 

3) . Prior Art 

D1 : WO 98 13503 A (F B INVESTMENTS PTY LTD fTEASDALE ROBERT 
DIXON (AU)) 2 April 1998 (1998-04-02) 

D2: MA ET AL.: 'AGL1-AGL6, an Arabidopsis gene family with similarity to floral 
homeotic and transcription factor genes' GENES & DEVELOPMENT, vol. 5, no. 3, 
March 1991 (1991-03), pages 484-495, XP000905073 

D3: MANDEL ET AL.: 'Arabidopsis thaliana MADS-box (AGL9) mRNA, complet 
cds.' EMBL SEQUENCE DATABASE, 29 August 1997 (1997-08-29), 
XP0021 45697 HEIDELBERG DE 

D4: WO 97 27287 A (UNIV CALIFORNIA) 31 July 1997 (1997-07-31) 

D1 relates to a method of modifying a plant (specifically trees) to increase 
vegetative growth. The method involves a tissue specific promoter expressing 
during development of both male and female plant reproductive structures 
controlling a cytotoxic gene (Barnase). The tissue specific promoters used in D1 , 
PrMADSI , 2 and 3, belong to the family of MADS-box genes showing homology to 
Arabidopsis ALG-2, ALG-4 and ALG-6. D1 further discloses a method for 
Agrobacterium-mediated transformation (pages 4-13, Example 3 and 6) 

D2 reports that AGL-2 nd AGL4 are preferentially expressed in flowers. In situ 
RNA hybridization experiments with AGL-1 and AGL-2 showed that their mRNAs 
are detected in some floral organs but not in others. 

D3 discloses that AGL-9 MADS-box gene is expressed in young flower primordia. 

D4 discloses that AP1 and LFY contribute to establishing the floral meristem 
(paragraph bridging page 11- page 12). 

4) . Novelty (Box V) 

The transgenic plant (claim 1 , 11, 12, 23) the tissue derived from said plant of 
claim 1 (claim 8-10), the method for producing said plant (claim 23) and the 



Form PCT/Separate Sheet/409 (Sheet 3) (EPO-April 1997) 



INTERNATIONAL PRELIMINARY International application No. PCT/US99/24407 

EXAMINATION REPORT - SEPARATE SHEET 



isolated nucleic acid molecule of claim 24 lack novelty, because due to the 
breadth of said claim subject matter disclosed in D1 falls under the scope of these 
claims (Art. 33 (2) PCT). 

None of the documents discloses the subject matter of claims 2, 4-7, 14-18, 25- 
29, 31 and 32. The subject matter of these claims is therefore considered to be 
novel. 

5) . Inventive Step (Box V) 

The problem underlying the present application, enhancing vegetative growth, 
can be derived from D1 which is regarded as the closest prior art document. 
The solution is the use of a tissue specific (floral organ specific) regulatory 
element which also is disclosed in D1 . The solution of the present application is 
the use of alternative floral organ specific elements, AGL-2 (invention 1), AGL-4 
(invention 2), AGL-9 (invention 3) and AP1 (invention 4). These alternative floral 
organ specific elements are all known in the art as being flower specific. The 
skilled person bearing in mind the teaching of D1 faced with the problem of 
providing alternative methods for enhancing vegetative growth in trees would have 
combined the teaching of D1 with either D2, D3 or D4 without the need of 
inventive skill and with a reasonable expectation of success. Consequently, the 
subject matter of claims 2, 4-7, 14-18, 25-29, 31 and 32 lacks an inventive step as 
required by Art. 33 (3) PCT. 

6) . Support, Art. 6 PCT (Box III) 

The description of the present application relates to floral organ specific elements 
in connection with GUS expression (see Examples) thus the transgenic plants of 
claims 1, 2, 4-7 has not been reduced into practice. 
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International application No. PCT/US99/24407 



I. Basis of th report 

1 . This report has been drawn on the basis of (substitute sheets which have been furnished to the receiving Office in 
response to an invitation under Article 14 are referred to in this report as "originally filed* and are not annexed to 
the report since they do not contain amendments (Rules 70, 16 and 70.17),): 

Description, pages: 

1-30 as originally filed 

Claims, No.: 

1-33 as originally tiled 

Drawings, sheets: 

1/43-43/43 as originally filed 

Sequence listing part of the description, pages: 

1-21 , filed with the letter of 22.02.00 

2. With regard to the language, alt the elements marked above were available or furnished to this Authority in the 
language In which the international application was filed, unless otherwise indicated under this Item. 

These elements were available or furnished to this Authority in the following language: , which Is: 

□ the language of a translation furnished for the purposes of the International search (under Rule 23.1 (b)). 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under Rule 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
International preliminary examination was carried out on the basis of the sequence listing: 

□ contained in the International application in written form. 

□ filed together with the international application in computer readable form. 
IS furnished subsequently to this Authority in written form. 

IS furnished subsequently to this Authority in computer readable form. 

IS The statement that the subsequently furnished written sequence listing does not go beyond the disclosure In 
the International application as filed has been furnished. 

E3 The statement that the information recorded In computer readable form is identical to the written sequence 
listing has been furnished. 

4. The amendments have resulted in the cancellation of: 
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□ the description, 

□ the claims, 

□ the drawings, 



pages: 

Nos,: 

sheets: 



5. □ This report has been established as If (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

(Any replacement sheet containing such amendments must be referred to under item 1 end annexed to this 
report,) 

6. Additional observations, if necessary; 

III Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

1 . The questions whether the claimed invention appears to be novel, to involve an Inventive step (to be non- 
obvious), or to be Industrially applicable have not been examined in respect of: 

□ the entire International application. 
E claims Nos. 3, 19-22,30,33. 



□ the said international application, or the said claims Nos. relate to the following subject matter which does 
not require an International preliminary examination (specify: 



□ the description, claims or drawings (indicate particular elements befoW) or said claims Nos. are so unclear 
that no meaningful opinion could be formed (specify): 



□ the claims, or said claims Nos* are so inadequately supported by the description that no meaningful opinion 
could be formed. 

S3 no international search report has been established for ths said claims Nos, 3,19-22,30,33, 

2. A meaningful international preliminary examination report cannot be carried out due to the failure of the nucleotide 
and/or amino acid sequence listing to comply with the standard provided for in Annex C of the Administrative 
Instructions; 

□ the written form has not been furnished or does not comply with the standard. 

□ the computer readable form has not been furnished or does not comply with the standard. 



IV. Lack of unity of Invention 

1 , h response to the invitation to restrict or pay additional fees the applicant has: 
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□ restricted the claims. 
IS paid additional fees. 

□ paid additional fees under protest. 

□ neither restricted nor paid additional fees. 

2. □ This Authority found that the requirement of unity of invention is not complied and chose, according to Rule 

68*1 , not to invite the applicant to restrict or pay additional fees. 

3. This Authority considers that the requirement of unity of invention in accordance with Rules 13.1, 13,2 and 13.3 is 

□ compiled with. 

IS not complied with for the following reasons: 
see separate sheet 

4. Consequently, the following parts of the international application were the subject of International preliminary 
examination In establishing this report: 

□ ait parts. 

S the parts relating to claims Nos. 1 ,2,4-1 8, 23-29, 31 ,32. 

V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 

1, Statement 

Novelty (N) Yes: Claims 2,4-7,14-18,25-29,31,32 

No; Claims 1,8-13,23,24 

Inventive step (IS) Yes: Claims 

No; Claims 2,4-7,14-18,25-29,31,32 

Industrial applicability (I A) Yes; Claims 1 ,2,4-18, 23-29,31 ,32 

No: Claims 

2. Citations and explanations 
see separate sheet 
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1 ) . The pres nt application relates to transg nic plants, specifically trees important in 

wood production, containing a nucleic acid molecule including a floral organ 
selective regulatory element linked to a nucleotide sequence encoding a cytotoxic 
gene product. 

2) . Unity (Box IV) 

The I PEA agrees with the ISA in regard to reasons for the non-unity objection 
which are the following: 

Methods for modifying plants to increase vegetative growth of commercially 
valuable plant structures at the expense of non-essential and non-commercial 
structures are already known in the art. in WO9B13503 (document D1) a method 
for producing a transgenic plant with enhanced vegetative growth is presented. In 
the method the plant is produced by introducing an expression cassette containing 
a structural gene for Barnase RNase under the control of a promoter that will 
cause specific expression of the gene in floral tissue. This leads to a decreased 
floral tissue growth and an increased vegetative tissue growth in the transgenic 
plant. 

Due to the fact that nucleic acid constructs comprising a floral organ selective 
regulatory element operatively linked to a nucleotide sequence encoding a 
cytotoxic gene product, and the use of said construct to produce a transgenic 
plant is already known from D1, the problem underlying the present application 
(as far as it had been subject to the Search Report) is the provision of further floral 
tissue specific promoters and transgenic plants containing them. 

The requisite unity of invention (Rule 13,1 PCT) therefore no longer exists 
inasmuch as a technical relationship involving one or more of the same or cor- 
responding special technical features in the sense of Rule 13.2 PCT does not 
exist between the subject-matter of the following groups of dependent claims: 

1. Claims ; partly: 1, 2, 8-14, 23-25, 31, 32 and completely: 4, 15, 26 

An isolated nucleic acid molecule comprising a floral organ selective 
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regulatory element operatively linked to a nucleotide sequence encoding a 
cytotoxic gene product, a transgenic plant containing said molecule and 
characterized by suppressed flowering and a method and kit for producing 
said plant, where the floral organ selective regulatory element is AGL2 (SEQ 
ID NO: 1). 

2. Claims : partly: 1, 2, 8-14, 23-25, 31, 32 and completely: 5, 16, 27 

An isolated nucleic acid molecule comprising a floral organ selective 
regulatory element operatively linked to a nucleotide sequence encoding a 
cytotoxic gene product, a transgenic plant containing said molecule and 
characterized by suppressed flowering and a method and kit for producing 
said plant, where the floral organ selective regulatory element is AGL4 (SEQ 
ID NO: 2). 

3. Claims : partly: 1, 2, 8-14, 23-25, 31, 32 and completely: 6, 17, 28 

An isolated nucleic acid molecule comprising a floral organ selective 
regulatory element operatively linked to a nucleotide sequence encoding a 
cytotoxic gene product, a transgenic plant containing said molecule and 
characterized by suppressed flowering and a method and kit for producing 
said plant, where the floral organ selective regulatory element is AGL9 (SEQ 
ID NO: 3). 

4. Claims : partly: 1, 2, 8-14, 23-25, 31, 32 and completely: 7, 18, 29 

An isolated nucleic acid molecule comprising a floral organ selective 
regulatory element operatively linked to a nucleotide sequence encoding a 
cytotoxic gene product, a transgenic plant containing said molecule and 
characterized by suppressed flowering and a method and kit for producing 
said plant, where the floral organ selective regulatory element is AP1 (SEQ 
ID NO: 10). 



In response to an invitation to pay additional fees or to restrict the claims, the 
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applicant has chosen to pay additional fees for inventions 2-4. 

3) . Prior Art 

D1 : WO 98 13503 A (F B INVESTMENTS PTY LTD ;TEASDALE ROBERT 
DIXON (AU)) 2 April 1998 (1998-04-02) 

D2: MA ET AL: 'AGL1 -AGL6, an Arabidopsis gene family with similarity to floral 
homeotic and transcription factor genes' GENES & DEVELOPMENT, vol. 5, no. 3, 
March 1991 (1991-03), pages 484-495, XP000905073 

D3: MANDEL ET AL.: 'Arabidopsis thaiiana MADS-box (AGL9) mRNA, complet 
cds.' EMBL SEQUENCE DATABASE, 29 August 1997 (1997-08-29), 
XP0021 45697 HEIDELBERG DE 

D4: WO 97 27287 A (UNIV CALIFORNIA) 31 July 1997 (1997-07-31) 

D1 relates to a method of modifying a plant (specifically trees) to increase 
vegetative growth. The method Involves a tissue specific promoter expressing 
during development of both male and female plant reproductive structures 
controlling a cytotoxic gene (Bamase). The tissue specific promoters used in D1 , 
PrMADSI , 2 and 3, belong to the family of MADS-box genes showing homology to 
Arabidopsis ALG-2, ALG-4 and ALG-6. D1 further discloses a method for 
Agrobacterium-mediated transformation (pages 4-13, Example 3 and 6) 

D2 reports that AGL-2 nd AGL4 are preferentially expressed in flowers. In situ 
RNA hybridization experiments with AGL-1 and AGL-2 showed that their mRNAs 
are detected in some floral organs but not in others. 

D3 discloses that AGL-9 MADS-box gene is expressed in young flower primordia. 

D4 discloses that AP1 and LFY contribute to establishing the floral merietem 
(paragraph bridging page 11- page 12). 

4) . Novelty (Box V) 

The transgenic plant (claim 1 , 1 1 , 12, 23) the tissue derived from said plant of 
claim 1 (claim 8-10), the method for producing said plant (claim 23) and the 
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isolated nucleic acid molecule of claim 24 lack novelty, because due to the 
breadth of said claim subject matter disclosed in D1 fade under the scope of these 
claims (Art. 33 (2) PCT). 

None of the documents discloses the subject matter of claims 2, 4-7, 14-18, 25- 
29, 31 and 32. The subject matter of these claims is therefore considered to be 
novel. 

5) . Inventive Step (Box V) 

The problem underlying the present application, enhancing vegetative growth, 
can be derived from D1 which is regarded as the closest prior art document. 
The solution is the use of a tissue specific (floral organ specific) regulatory 
element which also is disclosed in D1 . The solution of the present application is 
the use of alternative floral organ specific elements, AGL-2 (invention 1), AGL-4 
(invention 2), AGL-9 (invention 3) and AP1 (invention 4), These alternative floral 
organ specific elements are all known in the art as being flower specific. The 
skilled person bearing in mind the teaching of D1 faced with the problem of 
providing alternative methods for enhancing vegetative growth in trees would have 
combined the teaching of D1 with either D2, D3 or D4 without the need of 
inventive skill and with a reasonable expectation of success. Consequently, the 
subject matter of claims 2, 4-7, 14-18, 25-29, 31 and 32 lacks an inventive step as 
required by Art. 33 (3) PCT. 

6) . Support, Art. 6 PCT (Box III) 

The description of the present application relates to floral organ specific elements 
in connection with GUS expression (see Examples) thus the transgenic plants of 
claims 1 , 2, 4-7 has not been reduced into practice. 
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Applicant 

THE REGENTS OF THE UNIVERSITY OF CALIFORNIA et al. 



This International Search Report has been prepared by this International Searching Authority and is transmitted to the applicant 
according to Article 18. A copy is being transmitted to the International Bureau. 



This International Search Report consists of a total of _ 



sheets. 



[X] It is also accompanied by a copy of each prior art document cited in this report. 



1 . Basis of the report 

a. With regard to the language, the international search was carried out on the basis of the international application in the 
language in which it was filed, unless otherwise indicated under this item. 

I I the international search was carried out on the basis of a translation of the international application furnished to this 
L — Authority (Rule 23.1 (b)). 

b. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the international search 
was carried out on the basis of the sequence listing : 
[ [ contained in the international application in written form. 

filed together with the international appWcatton in computer readable form, 
furnished subsequently to this Authority in written form, 
furnished subsequently to this Authority in computer readble form. 



□ 
E 



the statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the 
international application as filed has been furnished. 

[X] the statement that the information recorded in computer readable form is identical to the written sequence listing has been 
furnished 

2. Q Certain claims were found unsearchable (See Box I). 

3. [X] Unity of invention is lacking (see Box II). 

4. With regard to the title, 

fX] the text is approved as submitted by the applicant. 

[ [ the text has been established by this Authority to read as follows: 



With regard to the abstract, 

[")C| the text is approved as submitted by the applicant 
the text has been established, according to Rule 3 

within one month from the date of mailing of this international search report, submit comments to this Authority. 
The figure of the drawings to be published with the abstract is Figure No 



| | the text has been established, according to Rule 38.2(b), by this Authority as it appears in Box III. The applicant may, 



( [ as suggested by the applicant. None of the figures. 

| | because the applicant failed to suggest a figure. 

| [ because this figure better characterizes the invention. 
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Box I Observations where certain claims were found unsearchable (Continuation of it m 1 f first sheet) 

This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 
1. J Claims Nos.: 
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An isolated nucleic acid molecule comprising a floral organ 
selective regulatory element operativly linked to a 
nucleotide sequence encoding a cytotoxic gene product, a 
transgenic plant characterised by suppressed flowering 
containing it and a method and kit for producing the plant, 
where the cytotoxic gene product is diphtheria toxic A chain. 
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flowering by intoducing a nucleic acid molecule comprising a 



page 2 of 3 



• 



International Application No. PCTAJS 99 £4407 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



floral organ selective regulatory element operativly linked 
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This International Search Report consists of a total of _ 

[X] It is also accompanied by a copy of each prior art document cited in this report. 



Basis of the report 

a. With regard to the language, the international search was carried out on the basis of the international application in the 
language in which it was filed, unless otherwise indicated under this item. 



□ 



the international search was carried out on the basis of a translation of the international application furnished to this 
Authority (Rule 23.1(b)). 



b. 



With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the international search 

was carried out on the basis of the sequence listing : 

[ | contained in the international application in written form. 

filed together with the international application in computer readable form, 
furnished subsequently to this Authority in written form, 
furnished subsequently to this Authority in computer readble form. 

the statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the 
international application as filed has been furnished. 

[X] the statement that the information recorded in computer readable form is identical to the written sequence listing has been 
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2. 
3. 



| [ Certain claims were found unsearchable (See Box I). 
PT[ Unity of invention is lacking (see Box II). 



4. With regard to the title, 

[X~| the text is approved as submitted by the applicant. 

| | the text has been established by this Authority to read as follows: 



With regard to the abstract, [/^ 

|"X~| the text is approved as submitted by the applicant. 

□ the text has been established, according to Rule 38.2(b), by this Authority as it appears in Box III. The applicant may, 
within one month from the date of mailing of this international search report, submit comments to this Authority. 

The figure of the drawings to be published with the abstract is Figure No. 

| | as suggested by the applicant. None of the figures. 

| | because the applicant failed to suggest a figure. 

( | because this figure better characterizes the invention. 
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3. I I Claims Nos.: 
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This International Searching Authority found multiple (groups of) 
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An isolated nucleic acid molecule comprising a floral organ 
selective regulatory element operativly linked to a 
nucleotide sequence encoding a cytotoxic gene product, a 
transgenic plant characterised by suppressed flowering 
containing it and a method and kit for producing the plant, 
where the floral organ selective regulatory element is AGL2 
(Seq Id No 1). 



2. Claims: partly: 1, 2, 8-14, 23-25, 31, 32 and completly: 5, 
16, 27 



An isolated nucleic acid molecule comprising a floral organ 
selective regulatory element operativly linked to a 
nucleotide sequence encoding a cytotoxic gene product, a 
transgenic plant characterised by suppressed flowering 
containing it and a method and kit for producing the plant, 
where the floral organ selective regulatory element is AGL4 
(Seq Id No 2). 



3. Claims: partly: 1, 2, 8-14, 23-25, 31, 32 and completly: 6, 
17, 28 



An isolated nucleic acid molecule comprising a floral organ 
selective regulatory element operativly linked to a 
nucleotide sequence encoding a cytotoxic gene product, a 
transgenic plant characterised by suppressed flowering 
containing it and a method and kit for producing the plant, 
where the floral organ selective regulatory element is AGL9 
(Seq Id No 3). 



4. Claims: partly: 1, 2, 8-14, 23-25, 31, 32 and completly: 7, 
18, 29 



An isolated nucleic acid molecule comprising a floral organ 
selective regulatory element operativly linked to a 
nucleotide sequence encoding a cytotoxic gene product, a 
transgenic plant characterised by suppressed flowering 
containing it and a method and kit for producing the plant, 
where the floral organ selective regulatory element is API 
(Seq Id No 10). 
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5. Claims: partly: 1, 3, 8-13, 19, 23, 24, 30, 31, 33 

An isolated nucleic acid molecule comprising a floral organ 
selective regulatory element operativly linked to a 
nucleotide sequence encoding a cytotoxic gene product, a 
transgenic plant characterised by suppressed flowering 
containing it and a method and kit for producing the plant, 
where the cytotoxic gene product is diphtheria toxic A chain. 



6. Claims: partly: 1, 3, 8-13, 19, 23, 24, 30, 31, 33 

An isolated nucleic acid molecule comprising a floral organ 
selective regulatory element operativly linked to a 
nucleotide sequence encoding a cytotoxic gene product, a 
transgenic plant characterised by suppressed flowering 
containing it and a method and kit for producing the plant, 
where the cytotoxic gene product is RNase Tl. 



7. Claims: partly: 1, 3, 8-13, 19, 23, 24, 30, 31, 33 

An isolated nucleic acid molecule comprising a floral organ 
selective regulatory element operativly linked to a 
nucleotide sequence encoding a cytotoxic gene product, a 
transgenic plant characterised by suppressed flowering 
containing it and a method and kit for producing the plant, 
where the cytotoxic gene product is Barnase RNase. 



8. Claims: partly: 1, 3, 8-13, 19, 23, 24, 30, 31, 33 

An isolated nucleic acid molecule comprising a floral organ 
selective regulatory element operativly linked to a 
nucleotide sequence encoding a cytotoxic gene product, a 
transgenic plant characterised by suppressed flowering 
containing it and a method and kit for producing the plant, 
where the cytotoxic gene product is ricin toxin A chain. 



9. Claims: partly: 1, 3, 8-13, 19, 23, 24, 30, 31, 33 

An isolated nucleic acid molecule comprising a floral organ 
selective regulatory element operativly linked to a 
nucleotide sequence encoding a cytotoxic gene product, a 
transgenic plant characterised by suppressed flowering 
containing it and a method and kit for producing the plant, 
where the cytotoxic gene product is herpes simplex virus 
thymidine kinase. 



10. Claims: partly: 13 and completly: 20-22 

A method for producing a transgenic plant having suppressed 
flowering by intoducing a nucleic acid molecule comprising a 
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floral organ selective regulatory element operativly linked 
to a nucleotide sequence encoding a cytotoxic gene product 
by Agrobacterium-mediated transformation. 
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Sa256076_0003 .Dna /rc^fex,— 

Empln:Ac00239 6 ^FaP-0021 45696 : 

ac smut, scanaara '- ™> ™> i2 "58 bp. iTtS^^G) 

SV AC002396.1 ' ^ ^ — y 
DT 07-AUG-1997 (Rel . 52, Created) ~~ ~~" 

dI !rS 20 °° " ' LaSt ^ted, Version 5) 

compteS PSiS thaliana chromosome I BAG F3I6 genomic sequence, 
DE sequence. 
KW HTG. 

OS Arabidopsis thaliana (thale cress) 

RN [1] 

RP 1-122358 
RA 
M. , 

RA Araujo R. , Buehler E., Dewar K ppnn t v,-™ r. T . 

RT ; 

RL Unpublished. 

RN [2] 

RP 1-122358 

"S., Feders * iel »•»•- falm C.J.-. Conway A.B., Kurtz D.B., Conway A.R.. Au 

RA Ecker J., Theologis A., Davis R.W. ; 

RL 
RL 

Center, 855 



Federspiel N.A., Palm C.J., Conway A.B., Kur t2 D.B., Conway a.R. . Au 



Submitted (31-JUL-1997) to _the EMBL /GenB ank /DDB J databases 
^ochemxstry, Stanford University /DNA Sequencing fnd felogy 



RN ^ ifornia Avenue, Palo Alto, CA 94304, USA 
RP 1-122358 

RA ^ Federspiel N.A., Palm C.J., Conway A.B., Kurtz D.B., Conway A.R., Au 
RA Araujo R., Buehler E . , Dewar K., Feng J. , Kim C Li Y OH i o 
^cke^ 0 " 16 B - Z -' Shinn P " Sun H " Tori ^ M., VyStskaia v:,°yu %\' t 

RA Theologis A., Davis R.W.; 
RT 

RL Stanf^ d n^ 6 ; JAN * 19 ? 8) t ^^ he EMBL/GenBank/DDBJ databases 
855 Stanf ° rd Sequencing aira Technology Center, Stanford University, 

RN California Avenue, Palo Alto", CA 94304, USA 

RP 1-122358 J''" 

RA ^ Federspiel „.»., Palm C.J. , Conway A.B., Kurt, D.B., Conway A. R. , Au 

RA Theologis A., Davis R.W. ; 

RT ; - - 



BNSDOCID: <XP 21 45696A^J_> 



California _ 
RL Avenue, Palo Alto, CA 94304, USA . . . 

SCORES Initl: 73170 Initn: 73170 Opt: 74436 z-scor : 90851.8 E(): 

0 

99.8% identity in 14943 bp overlap 

14939 14929 14919 

Sa2 5 6 0 7 6_0 0 0 TCTTGGGAATATTTTTTTGGCAGAAACAAA 

llllll IIIIIIIIIIIIIIIMIIIIIII 
Ac 0 0 2 3 9 6 TTGGATTAGGCTTCTATGGGTTTAAGATGTTCTTGGGAATATTTTTTTGGCAGAAACAAA 

66420 66430 66440 66450 66460 66470 

14909 14899 14889 14879 14869 14859 

Sa2 5 6 0 7 6_0 0 0 ACTTTTCGTCGAGGTGTTTTTCCACCACTGGAGATTCTGTTCAAGGATCCCCTGAGTCAG 
I I I I I I I I I I I I I I I I I I JJi I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Ac 0 0 2 3 9 6 ACTTTTCGTCGAGGTGTTTTTCCACCACTGGAGATTCTGTTCAAGGATCCCCTGAGTCAG 

66480 66490 66500 66510 66520 66530 

14849 14839 14829 14819 14809 14799 

Sa2 5 6 0 7 6_0 0 0 ATTGTGTTGTGCCTCAGXSJCSt'CAAGTTTAAGCGGCTTGATAAAACCGCTAAGCATATTA 

I I II MM llllll II II I III II I I II llllll II IIIIIIIIIIIIIIIMIIIIIII 
Ac 0 0 2 3 9 6 ATTGTGTTGTGCCTCAGXGAAl'CAAGTTTAAGCGGCTTGATAAAACCGCTAAGCATATTA 

66540 66550 66560 66570 66580 66590 

14789 14779 I4T65 14759 14749 14739 

Sa2 5 6 0 7 6_0 0 0 TGCAGGTACTACAACTCGAACTTGACACTTGTATTTAGATATATCATGTAGACAAGGAAG 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Ac 0 0 2 3 9 6 TGCAGGTACTACAACTCGAACTTGACACTTGTATTTAGATATATCATGTAGACAAGGAAG 

66600 66610 66620 66630 66640 66650 



i 



14729 14719 14T0~9~ 14699 14689 14679 

Sa2 5 6 0 7 6_0 0 0 CACCGATGTCACAATCCTCTGCAGATTGTAGACAAGGAAGCAGTTGAGGAAGTGAGAACT 

II IMI MMIMIIIM llllll MM II MM II IIIIIIIIIIIIIIIMIIIIIII 

Ac 0 0 2 3 9 6 CACCGATGTCACAATCCTCTGCAGATTGTAGACAAGGAAGCAGTTGAGGAAGTGAGAACT 

66660 66670 66680 66690 66700 66710 

14669 14659 14649 14639 14629 14619 

Sa2 5 6 07 6_00 0 CTTAGAGAGATTCCAGAGATAAAGCCTGGTTACATTGTGCAGCTAAAAGTGGTAACTACT 

i i i i i i i i i i i i i i i i Mini 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 n 1 1 1 1 1 1 ii 1 1 1 1 

Ac 0 0 2 3 9 6 CTTAGAGAGATTCCAGAGATAAAGCCTGGTTACATTGTGCAGCTAAAAGTGGTAACTACT 

66720 66770 6^6740 66750 66760 66770 

14609 14599 1 4589 14579 14569 14559 

Sa2 5 6 0 7 6_0 0 0 GTTGGCCTAGGTTGTAATTTCTTTTTCTTGGGTGAGTAAGATCAAACTATCTTTCATCTT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 I ITI 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Ac 0 0 2 3 9 6 GTTGGCCTAGGTTGTAA TTT CTTTTTCTTGGGTGAGTAAGATCAAACTATCTTTCATCTT 

66780 66790 66800 66810 66820 66830 

14549 14539 14529 14519 14509 14499 

S a 2 5 6 0 7 6_0 0 0 CTAAGTGAAACCTTT CATTTACTT TGTTTGAACAGGAAGTGCCTGAGAACAAGAGGCGTG 

i i i i i i i i m i i ii m i nm r i T i i i i i i i i i i i i i i i i i i i i i m i i i i i i i i i i i i 

Ac 0 0 2 3 9 6 CTAAGTGAAACCTTTCATTTSCTTTGTTTGAACAGGAAGTGCCTGAGAACAAGAGGCGTG 

66840 66850 66860 66870 66880 66890 

14489 14479 144 69 14459 14449 14439 

Sa2 5 6 0 7 6_0 0 0 TATCAATCGTAAAAGGCGTCGTCATAGCAAGGCGTAATGCTGGTCTCAACTCAACATTTA 

mm ii iMiiiMiiiiiin.ii mi iiiiii ii mi 1 1 iii in ii 1 1 ii 1 1 ii 1 1 

Ac 0 0 2 3 9 6 TATCAATCGTAAAAGKjCCTcTH'CATAGCAAGGCGTAATGCTGGTCTCAACTCAACATTTA 



BNSDOCID: <XP 2145696A_I_> 



r 66900 66910 66920 66930^ 66940 66950 

14429 14419 14409 14399 14389 14379 

Sa2 5 6 07 6_0 0 0 GAATTAGAAGGCTTGTTGCTGGAGTGGGCGTCGAATCCATGTTCCCCTTGTAAGTAACTT 

II llllllll lllllll! MM IIIIMIIIIIMMIIMIMM Mllllll MINI 

AC002396 GAATTAGAAGGCTTGTTGCTGGAGTGGGCGTCGAATCCATGTTCCCCTTGTAAGTAACTT 
66960 66970 66980 66990 67000 67010 

14369 14359 14349 14339 14329 14319 

Sa2 5 6 07 6_0 0 0 CAAGCTTCAAGCTTGTGCAGCCACTTTAATCTTTTTCTCTCAGACTTTTGTATATGACCA 

1 1 M 1 1 1 M II 1 1 M 1 1 1 M II 1 1 M M 1 1 1 M I M II 1 1 II II I M 1 1 M M 1 1 1 1 M I 

AcO 0 2 3 9 6 CAAGCTTCAAGCTTGTGCAGCCACTTTAATCTTTTTCTCTCAGACTTTTGTATATGACCA 
67020 67030 67040 67050 67060 67070 

14309 14299 14289 14279 14269 14259 

Sa2 5 6 076_00 0 TTAAGAGCCATTATTATAATCAAGAGCCACCATGTTTTGTATGTTCTCTTTGTTTGCCTT 
I I I I I I I II I I I I M II I II I I II I I I I I I I I I I I I I I I I I I I I I M I II M I II I I I II 
AC002396 TTAAGAGCCATTATTATAATCAAGAGCCACCATGTTTTGTATGTTCTCTTTGTTTGCCTT 
67080 67090 67100 67110 67120 67130 

14249 14239 14229 14219 14209 14199 

S a2 5 6 0 7 6_0 0 0 GGTCATGGAAAATACTGATGTAAGTGGAGGTGTTTTGGTGACAGGTATTCCCCAAACCTG 

MIMMMIMII MM MMMIIIIMMIIMIMIMM MMMMIMIMM 

AcO 0 2 3 9 6 GGTCATGGAAAATACTGATGTAAGTGGAGGTGTTTTGGTGACAGGTATTCCCCAAACCTG 
67140 67150 67160 67170 67180 67190 



14189 14179 14169 14159 14149 14139 

Sa256076_000 AGGGTGATTAAGGTGGTGGACAAGAAGAAAGT AAGAAGAGCCAAGCTTTATTACCTCAGG 

IMMMIIIIIIIIIIIIMI MM IMIIIIIMIIIIIIIMIIIIMI IMMIII 

AC002396 AGGGTGATTAAGGTGGTGGACAAGAAGAAAGTAAGAAGAGCCAAGCTTTATTACCTCAGG 
67200 67210 67220 67230 67240 67250 

14129 14119 14109 14099 14089 14079 

S a2 5 6 0 7 6_0 0 0 GAAAAGGTCAATGCTCTCAAGAAGCATTAACAGCCTTAATAAGAATTAGCAGTCTACTCT 

1 1 1 1 1 1 1 1 1 1 1 1 MMM MM II IIMMIMMIIIIIIMIMMIIIIMIIIMI 

AC002396 GAAAAGGTCAATGCTCTCAAGAAGCATTAACAGCCTTAATAAGAATTAGCAGTCTACTCT 

67260 67270 67280 67290 67300 67310 

14069 14059 14049 14039 14029 14019 

Sa2 5 6 0 7 6_0 0 0 TGTCTTTAAGAATTGAATTTGTGTAATCGTGAATCTCTTCAATTCTCTTGATAAGCCAAT 

1 1 M 1 1 M M 1 1 M II II M 1 1 1 1 1 1 1 M M M II M II M I II I II 1 1 1 1 M M I M II 

Ac 0 0 2 3 9 6 TGTCTTTAAGAATTGAATTTGTGTAATCGTGAATCTCTTCAATTCTCTTGATAAGCCAAT 
67320 67330 67340 67350 67360 67370 

14009 13999 13989 13979 13969 13959 

Sa2 5 6 0 7 6_0 0 0 CATCACAAGCAGGTTAAATTGATCTTTCATGAGTTCACCTCTCTGTTTTTGTTTCTACGT 

I M I M I II I 1 1 II II M M 1 1 II II 1 1 M M II M II 1 1 M M II M 1 1 M M M M II 

Ac 0 0 2 3 9 6 CATCACAAGCAGGTTAAAf TGATCTTTCATGAGTTCACCTCTCTGTTTTTGTTTCTACGT 

67380 67390 67400 67410 67420 67430 

13949 13939 13929 13919 13909 13899 

Sa256076_000 TGAACTCTGTTTTTTGAGTGTTTCCCTTCTCAGTTTCTTATAAAAAGATTACCCTGAAAA 

MMM MMM MM IMMIII MM MIMIII MM MMMM MMMM MM 

AcO 0 2 3 9 6 TGAACTCTGTTTTTTGAGTGTTTCCCTTCTCAGTTTCTTATAAAAAGATTACCCTGAAAA 
67440 67450 67460 67470 67480 67490 

13889 13879 13869 13859 13849 13839 

Sa2 56 07 6_0 0 0 TAGCATATTGAACACCTCAACATAAAAGAAACCTCAACAAAGAAACTATACACTCTCTTT 

M M 1 1 II II M I II II I II I M II II I II II II I II 1 1 1 II II II M I M I II M II II 
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Ac002396 TAGCATj^pSAACACCTCAACATAAAAGAAACCI^BCAAAGAAACTATACACTCTCTTT % 

6750(T 67510 67520 67530 67540 67550 

13829 13819 13809 13799 13789 13779 L> 

Sa256076_000 TAGTCTCTTATACACCAATTCATTGTACACCTCAACATAGAAGCATAGCATATTTTTGAA 

MIMIIMIIIIIIIMMIIIIIIMIMI MINIM MIMIIIIMIMIIIMI 

Ac002396 TAGTCTCTTATACACCAATTCATTGTACACCTCAACATAGAAGCATAGCATATTTTTGAA 
67560 67570 67580 67590 67600 67610 

13769 13759 13749 13739 13729 13719 

Sa2 5 6 0 7 6_0 0 0 TGTCATGTTAATCAAACAAAACTCTAATGTTTGCAAAAGAAAACTAAAAACACTCTCTTT 

I II I I I I I I II I I II I I I I I I I I I I II I I I I I I I I I I I I I I M I I I I I I I I II I I I I II I 
AcO 0 2 3 9 6 TGTCATGTTAATCAAACAAAACTCTAATGTTTGCAAAAGAAAACTAAAAACACTCTCTTT 
67620 67630 67640 67650 67660 67670 

13709 13699 13689 13679 13669 13659 

Sa2 5 6 07 6_00 0 TCGTGAGAAGCTCTACTTTCTTTTTCGTCTCTTATGGTATATTACAGTCTTCCAGTACTC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I i I I I I I I I 1 I I I I 1 I I I I I I I I i I I I I I I 
Ac002396 TCGTGAGAAGCTCTACTTTCTTTTTCGTCTCTTATGGTATATTACAGTCTTCCAGTACTC 
67680 67690 67700 67710 67720 67730 



13649 13639 13629 13619 13609 13599 

Sa2 5 6 07 6_0 0 0 TCCTTAGCTATCTGGCTA'PCTCTAGACTCCAGTAGGTGGATCAGTCAAAATTTGTGAAAG 

MINI MIIMMII IMMIIIIMI MIMM I III! MINI IMMI III Mill 

Ac002396 TCCTTAGCTATCTGGCTATCTCTAGACTCCAGTAGGTGGATCAGTCAAAATTTGTGAAAG 
67740 67750 67760 67770 67780 67790 

13589 13579 13569 13559 13549 13539 

S a2 5 6 0 7 6_0 0 0 CTCCATGACT AAATCGTCATGACCCTGAACAAGTGCAACGACCTGCAAAAGATAACAAAG 
I I I I I I I I I II I I I II I I I I I I I I I I I I I I II I I I I I I I II I I I II I I I II I I II I I I I I 
AcO 0 2 3 9 6 CTCCATGACT AAATCGTCATGACCCTGAACAAGTGCAACGACCTGCAAAAGATAACAAAG 

67800 67810 67820 67830 67840 67850 

13529 13519 13509 13499 13489 13479 

Sa2 5 6 07 6_00 0 CAAATCAAACCAAAGAACTAAGAAAGTGCAACAC^CAACACAAAGTATCCTACCAAGAAG 
I I II M I I I I I II I I I I I I I I II I I II I II II I I I I I II I I I I I I I I I I M I I I M II II 
AC002396 CAAATCAAACCAAAGAACTAAGAAAGTGCAACACACAACACAAAGTATCCTACCAAGAAG 
67860 67870 67880 67890 67900 67910 

13469 13459 13449 13439 13429 13419 

S a2 5 6 0 7 6_0 0 0 CCTACCTCCTGATACGCATCATGTTTGGACTTGTTTCCCTCACCGTACATTCTCATTATC 

M M M M 1 1 M M 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AC002396 CCTACCTCCTGATACGCATCATGTTTGGACTTGTTTCCCTCACCGTACATTCTCATTATC 
67920 67930 67940 67950 67960 67970 

13409 13399 13389 13379 13369 13359 

S a2 5 6 07 6_0 0 0 TGAAGAACTGAGTCAACTACTTGACTACCATCGCCCTGAAACCTTGCCTGCACATGGACA 

I I I I I I I I I I I I I I I I 1 IT! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AcO 02 3 9 6 TGAAGAACTGAGTCAACTSCTTGACTACCATCGCCCTGAAACCTTGCCTGCACATGGACA 

67980 67990 68000 68010 68020 68030 

13349 13339 13329 13319 13309 13299 

Sa2 5 6 0 7 6_0 0 0 ATTTTGTTACTTAAACCAC^GTTAACATATAGGAAAC^TCAAATGGTAACTATAAGCAC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I ITT 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Ac002396 ATTTTGTTACTTAAACCACAAGTTAACATATAGGAAACATCAAATGGTAACTATAAGCAC 
68040 68050 68060 68070 68080 68090 

13289 13279 13269 13259 13249 13239 

Sa256076_000 TCTATACCTTAAGCTTCGTGGTAAAACTCGTTGTAGCTGAAAGGAAGACACAGAAACCAA 

II IMMI MMIMMril MUM Mill I MM MM II Mill I III MM Mill 
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P^^CTTCGTGGTAAAACTCGTTGTAGcfft^AC 



^ Ac002396 tctatacct^PgcttcgtggtaaaactcgttgtagcWKaaggaagacacagaaaccaa 

68100 68110 68120 68130 68140 68150 

13229 13219 13209 13199 13189 13179 

Sa2 5 6 0 7 6_0 0 0 AAAGCAGATTCAAGTGATCTTTCATGAGTTCCTCCACCCTAGCAATGACACAAGCGGCAT 

IMIIIIIIIMIIIIIIII IIIIIMIIIIIIIIIIIIIIMIMIMIIIII MINI 

Ac 0 0 2 3 9 6 AAAGCAGATTCAAGTGATCTTTCATGAGTTCCTCCACCCTAGCAATGACACAAGCGGCAT 
68160 68170 68180 68190 68200 68210 

13169 13159 13149 13139 13129 13119 

Sa2 5 6 07 6_0 0 0 CGACTCTACAAAGAGACTTAGTAACATCAGAATCAATTCACATAATGTGTTTGTTCTGTA 

M MINIMUM IIIMIMIMIIIII Mill! IIMIIIIIIIMIIIIIIIIMI 

Ac 0 0 2 3 9 6 CGACTCTACAAAGAGACTTAGTAACATCAGAATCAATTCACATAATGTGTTTGTTCTGTA 
68220 68230 68240 68250 68260 68270 



13109 13099 13089 13079 13069 13059 

Sa2 5 6 0 7 6__0 0 0 GACTGAAAAAATCAAACATAACATGGAGAAAAGAACTAATACATACCTGCGAGCTTTAAA 

1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 I M 1 1! 1 1 1 1 1 1 1 1 1 M I II 1 1 1 1 

Ac 0 0 2 3 9 6 GACTGAAAAAATCAAACATAACATGGAGAAAAGAACTAATACATACCTGCGAGCTTTAAA 
68280 68290 68300 68310 68320 68330 

13049 13039 13029 13019 13009 12999 

Sa25 6 07 6_000 ATCTTTCAAGAGCTTAAGCATTTCCCCATATTTTGCAGGTTCATCATGAAAGGCTTCCTT 

1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f ! I ! 1 1 1 1 1 1 1 1 1 ( 1 1 1 i t f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Ac 0 0 2 3 9 6 ATCTTTCAAGAGCTTAAGCATTTCCCCATATTTTGCAGGTTCATCATGAAAGGCTTCCTT 

68340 68350 68360 68370 68380 68390 

12989 12979 12969 12959 12949 12939 

Sa2 56 07 6_0 0 0 CACAGCAATAAGGTATGAAGTCGCATCATCTATGGTAGGCTCCGGACGTGGTAGACTTCT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 

Ac 0 0 2 3 9 6 CACAGCAATAAGGTATGAAGTCGCATCATCTATGGTAGGCTCCGGACGTGGTAGACTTCT 
68400 68410 68420 68430 68440 68450 

12929 12919 12909 12899 12889 12879 

Sa2 5 6 0 7 6_0 0 0 TCCAACCACCTTTGGTGGTACTGGTACGCTTCTTCCAACCACCTTATGAAACTCTTCGCT 

M 1 1 1 M II 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1 ) I 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 

AcO 0 2 3 9 6 TCCAACCACCTTTGGTGGTACTGGTACGCTTCTTCCAACCACCTTATGAAACTCTTCGCT 
68460 68470 68480 68490 68500 68510 

12869 12859 12849 12839 12829 12819 

Sa256076_000 AGCCTCGGGAGGTATGGTTAACTGATACTCAGGTGGAAGTAAGACATTTAAACCAAGACA 

II MINI MINI Mllll MM MINI II MM MINI II MM II II II Mil 1 1 

Ac002 39 6 AGCCTCGGGAGGTATGGTTAACTGATACTCAGGTGGAAGTAAGACATTTAAACCAAGACA 
68520 68530 68540 68550 68560 68570 

12809 12799 12789 12779 12769 12759 

Sa256076_000 CAGACTCGGGTGATCTTTCAAGAGGTCCTGCATGCGTGCACTGAAAGTAGGCATACCGAT 

I M I I I I I I I I I I I I I I I 111 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AC002396 CAGACTCGGGTGATCTTTCAAGAGGTCCTGCATGCGTGCACTGAAAGTAGGCATACCGAT 

68580 68590 68600 68610 68620 68630 

12749 12739 12729 12719 12709 12699 

Sa 2 5 6 0 7 6_0 0 0 CCTACAAAGAGAAAAATAACATCAGAATCAATTCTCACATGTGTCATTGTTCTTTAGACT 

II MINI MINIMUM II MIMMIMM MM MINIUM II M M Mllll 

Ac002396 CCTACAAAGAGAAAAATAAGATCAGAATCAATTCTCACATGTGTCATTGTTCTTTAGACT 
68640 68650 68660 68670 68680 68690 

12689 12679 12669 12659 12649 12639 

Sa25607 6_000 GTTTAAGAGCAAACCAAACACACCTAGGAGCTGTAAAACCACGAAAGAGTCTTACAAACA 
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H M 1 1 1 1 M i II 1 1 II I M I I II 1 1 1 1 II 1 1 1 1 II II MMM 

Ac 0 0 2 3 9 6 GTTTAAGAGCAAACCAAACACACCTAGGAGCTGTAAAACCACGAAAGAGTCTTACAAACA 
68700 68710 68720 68730 68740 68750 

12629 12619 12609 12599 12589 12579 

Sa2 5 6 0 7 6„0 0 0 TTCCAAATCTTGCAGGTTCCATTTCCTGCAATGCGACCTCCACAGCATTGACGTATGCCT 

MM II MM II II I! II Mil I MIM IIMM II MM M llllllll IIIIII II (I 

Ac 0 0 2 3 9 6 TTCCAAATCTTGCAGGTTCCATTTCCTGCAATGCGACCTCCACAGCATTGACGTATGCCT 
68760 68770 68780 68790 68800 68810 



12569 12559 12549 12539 12529 12519 

S a 2 5 6 0 7 6_0 0 0 TTACATCTTCCAAAGATGACGCCGGTGATAAACTTCCTCCAGCCATGGTGTGAAATTGAC 

M II M I Mill M II M MM II 1 1 M 1 1 Ml III Ml I M II M I MMM MM Ml 

AcO 0 2 3 9 6 TTAC^TCTTCCAAAGAT"GACGCCGGTGATAAACTTCCTCCAGCCATGGTGTGAAATTGAC 
68820 68830 68840 68850 68860 68870 

12509 12499 12489 12479 12469 12459 

Sa2 56 07 6_00 0 CTTATCCATGGTTCA7LAGATAAAGATTTGTGTTCATCAAAACAGATCAGATTGATCATTG 

II M II II M II M MM 1 1 II I M Mill I I II M M ! Ml M M M II 1 1 II II I Ml 

Ac002396 CTTATCCATGGTTCAAAGATAAAGATTTGTGTTCATCAAAACAGATCAGATTGATCATTG 

68880 68890 68900 68910 68920 68930 

12449 12439 12429 12419 12409 12399 

Sa2 5 6 0 7 6_0 0 0 AAGCCTTTCTTGTATCTCAAGAAACTTAATCCATAACAACAAGAAGATCAGACCAGTTCG 

II M I II M M I 1 1 II 1 1 M 1 1 II II 1 1 M M M II II 1 1 M I 1 1 1 1 1 I M I M II M M 

AcO 0 2 3 9 6 AAGCCTTTCTTGTATCTCAAGAAACTTAATCCATAACAACAAGAAGATCAGACCAGTTCG 
68940 68950 68960 68970 68980 68990 

12389 12379 12369 12359 12349 12339 

S a 2 5 6 0 7 6_0 0 0 TCGATATGGATGATTCATGCAAAGTTCATCGAACAAGATCCTTAAGAACATCTAGAATCA 

I M MM M MM MM MM MM MM! M M III MM1 Ml Ml Mil MM MM 

Ac 0 0 2 3 9 6 TCGATATGGATGATTCATGCAAAGTTCATCGAACAAGATCCTTAAGAACATCTAGAATCA 
69000 69010 69020 69030 69040 69050 

12329 12319 12309 12299 12289 12279 

Sa25607 6_000 GTCTCGATTGAAAGAAAACAACAAGATGTGCTGATCAAGAAACTAGAAGATAATCATATA 

MM MMM Mill Ml M IMMMMI II II I! M Mi Ml II ! Ml M M I 1 MM 

Ac00239 6 GTCTCGATTGAAAGAAAAC^CAAGATGTGCTGATCAAGAAACTAGAAGATAATCATATA 
69060 69070 69080 69090 69100 69110 

12269 12259 12249 12239 12229 12219 

Sa25607 6_00 0 CGAAATTAAGAAATTCGATCGTTAACCTAATCGTCACGTCACATGATTTCACATTTCAGA 

I III II II II II II I III II MINI MM Mil MMM II I Ml III I Mil III Ml 

Ac 0 0 2 3 9 6 CGAAATTAAGAAATTCGATCGTTAACCTAATCGTCACGTCACATGATTTCACATTTCAGA 
69120 69130 69140 69150 69160 69170 

12209 12199 12189 12179 12169 12159 

Sa2 5 6 0 7 6_0 0 0 TGAACGAAAAAGGAAT TGAAAAGAAGAAGAAGAAGAAGATAATCACCTTACGAGTAAGCT 

MM II MM M MM i Mill II II M I ! MM M I III II MM II 1 1 II II MMM 

AcO 0 2 3 9 6 TGAACGAAAAAGGAATTG&AAAGAAGAAGAAGAAGAAGATAATCACCTTACGAGTAAGCT 
69180 69190 69200 69210 69220 69230 

12149 12139 12129 12119 12109 12099 

Sa2 56076_000 TCGTCTTCGCAAGTGAGAGGCTTGAAGATTCCCAGAAACAAACGCACTCTGTTTTGAGCT 

MMM MMM MM II MM It III I MM I II I MM II II Ml MMIM M MM 

Ac 0 0 2 3 9 6 TCGTCTTCGCAAGTGAGAGGCTTGAAGATTCCCAGAAACAAACGCACTCTGTTTTGAGCT 
69240 69250 69260 69270 69280 69290 

12089 12079 12069 12059 12049 12039 

S a2 5 6 0 7 6_0 0 0 CTTCGGTTTTTCTTTTCCAGCAAAACCTCAGAGACTGATTCGAGTATTATTCACGAGTCT 
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1^1 



1 1 1 1 1 1 1 1 1 INI MIMIIIIIIMIIIM IMWIII IIIIIIIMIMIMIM 

AcO 0 2 3 9 6 CTTCGGTTTTTCTTTTCCAGCAAAACCTCAGAGACTGATTCGAGTATTATTCACGAGTCT 
69300 69310 69320 69330 69340 69350 

12029 12019 12009 11999 11989 11979 

Sa256076_000 ATATTTATAGTAGGAATCTATCATATTTGACTTCATTTTATTATTGTCAACTTTTGTTTG 
IIIIIMIIIII llllllllllll IIIIIIMIMIIIIIIIIIIIII llllll MM II 
Ac 0 0 2 3 9 6 ATATTTATAGTAGGAATCTATCATATTTGACTTCATTTTATTATTGTCAACTTTTGTTTG 
69360 69370 69380 69390 69400 69410 

11969 11959 11949 11939 11929 11919 

Sa2 5 6 0 7 6_0 0 0 CATTGATCACAT AATTAGCATAAGCTTATAAAACTATAATTGGTAAAATCTGTGAAACTG 

II 1 1 1 1 M M 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 ! M II M I II 1 1 1 1 1 1 1 M M I M M 1 1 1 1 1 

Ac002396 CATTGATCACATAATTAGCATAAGCTTATAAAACTATAATTGGTAAAATCTGTGAAACTG 

69420 69430 69440 69450 69460 69470 

11909 11899 11889 11879 11869 11859 

S a 2 5 6 0 7 6_0 0 0 AAATTTTAAAAACTTTCTATATATCAAAGATGATGAGAATTTTTTTTTTGTGATGATTGG 

I II II II I II II II I I I I II I I II I I I II II I I I I I I I II I I II I II I M I I I M I II I I 
Ac 0 02 3 9 6 AAATTTTAAAAACTTTCTATATATCAAAGATGATGAGAATTTTTTTTTTGTGATGATTGG 

69480 69490 69500 69510 69520 69530 

11849 11839 11829 11819 11809 11799 

Sa2 5 6 07 6_00 0 AATACAGATATTTCTATTTATTTTAGTCCAGACATATATAAATTAGATATTTAAAATATT 

I II II II II M II I I II II I II II I II M 1 1 II I II I II II II I I II 1 1 II I II 1 1 II 1 1 

Ac 0 0 2 3 9 6 AATACAGATATTTCTATTTATTTTAGTCCAGACATATATAAATTAGATATTTAAAATATT 
69540 69550 69560 69570 69580 69590 

11789 11779 11769 11759 11749 11739 

Sa2 5 6 0 7 6_0 0 0 TTCATTAACTTCGTAAGATAATTCCAAGATAAAGTAATTAAAATAAATCGTAAATAATTC 

I II 1 1 1 1 i 1 1 II 1 1 II II II 1 1 1 1 1 M I I II 1 1 1 1 1 1 M 1 1 1 1 II II II 1 1 1 1 1 1 1 II II 

AcO 02 3 9 6 TTCATTAACTTCGTAAGATAATTCCAAGATAAAGTAATTAAAATAAATCGTAAATAATTC 
69600 69610 69620 69630 69640 69650 

11729 11719 11709 11699 11689 11679 

Sa2 5 6 07 6_0 0 0 ATATAACATTTACATAAATTATAAAGCATATTTGTGGTTGAGAAAAATAATAATTATAAA 

II I II I I I I I I I II I I I II II I I I I I I I I M I I II I I I II II I I II I I II I I II I II I M 
Ac002396 ATATAACATTTACATAAATTATAAAGCATATTTGTGGTTGAGAAAAATAATAATTATAAA 

69660 69670 69680 69690 69700 69710 

11669 11659 11649 11639 11629 11619 

Sa2 5 6 07 6_0 0 0 GCATATTTGTAAATGGATTTACTTGATTTTGGTTTGGTTTGCTCTGCTTTTTTTGGGCCC 

I f 1 1 i I f 1 1 1 f I ( 1 1 1 f M 1 1 1 1 1 1 1 1 1 1 1 ( 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f i I M 1 1 1 1 1 1 1 1 1 

AcO 0 2 3 9 6 GCATATTTGTAAATGGATTTACTTGATTTTGGTTTGGTTTGCTCTGCTTTTTTTGGGCCC 
69720 69730 69740 69750 69760 69770 

11609 11599 11589 11579 11569 11559 

Sa2 5 6 07 6_00 0 AATTATAGGGATGAGTTTGGGCTCTGTTCCTCGTTTATTGTTTCTGGTGTGAAGTGTGAA 

M 1 1 1 1 1 1 1 1 1 1 I II 1 1 1 i 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 

AcO 0 2 3 9 6 AATTATAGGGATGAGTTTGGGCTCTGTTCCTCGTTTATTGTTTCTGGTGTGAAGTGTGAA 
69780 69790 69800 69810 69820 69830 

11549 11539 11529 11519 11509 11499 

Sa2 5 6 07 6_00 0 TTTTTTTTCCGGTGTTTTCTGGGAAACTACATGGCCCAAATCTGTATTCTAGAAGATGAA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M I II II II 11 1 II i I II 1 1 1 1 1 11 1 1 1 1 1 M 1 1 1 1 1 

Ac002396 TTTTTTTTCCGGTGTTTTCTGGGAAACTACATGGCCCAAATCTGTATTCTAGAAGATGAA 
69840 69850 69860 69870 69880 69890 



11489 11479 11469 11459 11449 11439 
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Sa25607 6_000 CTAATTi^BAGGACAAATATTTTACCAAGTACAi^^TCTCTGAACTGAACTTAAACCT 

* ™. * 111111 ™ 1 1 1 1 1 1 1 1 1 1 1 i 1 1 i M I I I I I I I ITTI I I I I I I I I I I I I I I I I I I I I I I 

AcO 0 2 3 9 6 CTAATTAACAAGGACAAATATTTTACCAAGTACAATAATCTCTGAACTGAACTTAAACCT 
69900 69910 69920 69930 69940 69950 

11429 11419 11409 11399 11389 11379 

Sa2 5 6 07 6_0 0 0 CTCCACAAAAGGAACTATACACTATATAGCCAATTAGTGTAATTGTTAAAAATATATGTT 

I Ml II I Mill II Ml III II I Mill II I III ! I MM IMIIMIII MM MM I I 

AcO 0 2 3 9 6 CTCCACAAAAGGAACTATACACTATATAGCCAATTAGTGTAATTGTTAAAAATATATGTT 

69960 69970 69980 69990 70000 70010 

11369 11359 11349 11339 11329 11319 

Sa2 5 6 0 7 6_0 0 0 CATATATTGATGAATCTTTATGATATATTTAAAAAATGATCAATTGTACAAAAGTTATTT 

M MM II Ml I MM Ml IIIMIMI II II M MIMI II MM II MIMI MMM 

Ac 0 0 2 3 9 6 CATATATTGATG AATCTTTATG ATAT ATTTAAAAAATG ATCAATTGTACAAAAGTTATTT 

70020 70030 70040 70050 70060 70070 

11309 11299 1T2F9 11279 11269 11259 

S a2 5 6 0 7 6_0 0 0 AGTTTTTTTTTTTATAAATATAGTTGACATATTACT ATTATTTTGAAAAACAATCCGAAT 

IMM II I M M I I M II II MIMI Mil II II M I I MM II MIMI II M M Ml I 

Ac 0 0 2 3 9 6 AGTTTTTTTTTTTATAAATATAGTTGACATATTACTATTATTTTGAAAAACAATCCGAAT 
70080 70090 70100 70110 70120 70130 

11249 11239 11229 11219 11209 11199 

Sa2 5 6 0 7 6_0 0 0 AACCCTAGTTACTAAAACAAACAAGAAACAAAATATATGATCAATGAGTATGAGAAGGTA 

M M M M M M II! Ill M !IMM M II M M II II II II M II MM I IMMM I I 

AcO 02 3 9 6 AACCCTAGTTACTAAAACAAACAAGAAACAAAATATATGATCAATGAGTATGAGAAGGTA 
70140 70150 70160 70170 70180 70190 

11189 11179 11169 11159 11149 11139 

Sa2 5 6 0 7 6__0 0 0 GTACCTTCCTCTTGGAACTCTACAAACTCGTTCATAATCATTAATATTCATATTCACGGG 

M II M MM I Ml M I Mill MM I I II M M I I II II II MM Mill! M MMM 

AcO 0 2 3 9 6 GTACCTTCCTCTTGGAACTCTACAAACTCGTTCATAATCATTAATATTCATATTCACGGG 
70200 70210 70220 70230 70240 70250 

11129 11119 11109 11099 11089 11079 

Sa2 5 6 0 7 6_ 0 0 0 ATGGTGACAATGTTCTTGTGCATTGTTAGCACATGCATATAGCCGACACTGTTGATTAAC 

M M II II M III III III MMIIMM Ml I MM I II M M MM M II II II MM 

AcO 0 2 3 9 6 ATGGTGACAATGTTCTTGTGCATTGTTAGCACATGCATATAGCCGACACTGTTGATTAAC 
70260 70270 " 70280 70290 70300 70310 

11069 11059 11049 11039 11029 11019 

S a2 5 6 0 7 6_J> 0 0 TGTTGAAATGGCGTTAGAGGAT ATTTGT CATATGGT ATCT CGTCCCTTTGTTCACTGTTG 

M IMM II! II M III! M II IMIIMI M II II I Ml II II III I MMIIMM II 

Ac 0 0 2 3 9 6 TGTTGAAATGGCGTTAGAGGAT ATTTGTCATATGGT ATCTCGTCCCTTTGTTCACTGTTG 

70320 70330 70340 70350 70360 70370 

11009 10999 10989 10979 10969 10959 

Sa2 5 6 0 7 6_ 0 0 0 GCAACCTAAACTCAAATAATAAAAAAGGGTTGTTCTTCAGACAAGAATTACGATCTCTTA 

M II i Mill II II MM II MIMI MIMI M II I I IMMI II MM MMM M M 

AcO 02 39 6 GCAA CCT AAACT CAAATAATAAAAAAGGGTTGTT CTTCAGACAAGAATTACGAT CTCTTA 

70380 70390 70400 70410 70420 70430 

10949 10939 10929 10919 10909 10899 

Sa2 5 6 07 6„0 0 0 GTTACTTGACGGAAATTTTCTCCCGGCTTTTCTTGCATAGAAACTTCGTAAGTCATAGAA 

M I Ml II I IMM II MMIIMM! 1 1! IM I II M II M MIMI MM II il 

AcO 02396 GTTACTTGACGGAAATTTTCTCCCGGCTTTTCTTGCATAGAAACTTCGTAAGTCATAGAA 
70440 70450 70460 70470 70480 70490 

10889 10879 10869 10859 10849 10839 
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CTTATATAATTTGTCAACATATAC^lfetAC 



» Sa256076_000 TATGCAATAOHRCTTATATAATTTGTCAACATATACaiBiAGACTTACATCAAAAAGAT 

II MINI MM II MM MM 1 1 II INI II II II 1 1 MUM 1 1 II NIM Ml Ml I 

4 Ac002396 TATGCAATACAAACTTATATAATTTGTCAACATATACATCAAGACTTACATCAAAAAGAT 

70500 70510 70520 70530 70540 70550 

10829 10819 10809 10799 10789 10779 

S a2 5 6 0 7 6_0 0 0 CACAACTCACATCTTTAAGTAAAAACTTGTCTGCTAACTTCACAGTGTGACTTTCATTAT 
I I II II I I II I I II I II I II I I II I I M I I II I I II I I I M I I I I I I I M II II I I M I I 
AC002396 CACAACTCACATCTTTAAGTAAAAACTTGTCTGCTAACTTCACAGTGTGACTTTCATTAT 
70560 70570 70580 70590 70600 70610 

10769 10759 10749 10739 10729 10719 

Sa2 5 6 0 7 6_0 0 0 AGCCTACTGTGGTTCCCTGCATAATATACAGTTTTACCTGAAACTCCTTAACAATATCTG 

1 1 1 1 II II II I II 1 1 II I M II II I II M I I II 1 1 1 I M M II I 1 1 1 1 I M II I II 1 1 M 

AC002396 AGCCTACTGTGGTTCCCTGCATAATATACAGTTTTACCTGAAACTCCTTAACAATATCTG 
70620 70630 70640 70650 70660 70670 

10709 10699 10689 10679 10669 10659 

Sa2 5 6 0 7 6_0 0 0 TGTTGAAAAAAATGGCTCCAAAAAGAACCAAAACTGGTAACAAGTTCCCCACATTTTTTA 

I I I M I ! 1 1 1 1 1 11 1 1 M I M I 1 1 M 1 11 1 1 M M M 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 M I 

AcO 0 2 3 9 6 TGTTGAAAAAAATGGCTCCAAAAAGAACCAAAACTGGTAACAAGTTCCCCACATTTTTTA 
70680 70690 70700 70710 70720 70730 

10649 10639 10629 10619 10609 10599 

Sa256076_000 TTCGAATATTTTTATATGTTTAACAAAGCCATGAGTTTTTGGTTTGTTCGGCGCATGTGG 

I II I I I I I II I I M I I II I II I I I I I II I I II I I II I I I I I I I I II I I M II I I I I I I II 
AcO 02 39 6 TTCGAATATTTTTATATGTTTAACAAAGCCATGAGTTTTTGGTTTGTTCGGCGCATGTGG 
70740 70750 70760 70770 70780 70790 

10589 10579 10569 10559 10549 10539 

Sa2 5 6 0 7 6_0 0 0 TTATGCATGCTTTGGTTTGTTCTATAGTTCATGGGTTCAAATCCACCCTTCAACATTATT 

I M I II 1 1 II M I I 1 1 1 1 II 1 1 M 1 1 1 1 M M II II II M 1 1 1 1 I M I II 1 1 I II II M I 

AcO 0 2 3 9 6 TTATGCATGCTTTGGTTTGTTCTATAGTTCATGGGTTCAAATCCACCCTTCAACATTATT 
70800 70810 70820 70830 70840 70850 

10529 10519 10509 10499 10489 10479 

Sa2 5607 6_000 ATTATTCTTATTTTTAAAATGCTTAAAACAACATAGTTTTGATAATTTGACTTATACTGT 
I I I I II I I I M I I I I M I I I I I I I I I II I I II I I I I I I II I I II I I II II II II II I I I I 
Ac00239 6 ATTATTCTTATTTTTAAAATGCTTAAAACAACATAGTTTTGATAATTTGACTTATACTGT 
70860 70870 70880 70890 70900 70910 

10469 10459 10449 10439 10429 10419 

S a2 5 6 0 7 6_0 0 0 TTCTAAAATCTATACACGAATAAAGGTTATACGGGTTACTTTTGATATTGCGGTATTGAA 

I ( M 1 1 1 1 1 1 1 1 M i I I II I M 1 1 1 1 1 M 1 1 1 ( I 1 1 II M ! I M 1 1 1 1 1 1 1 1 M I M 1 1 1 

AcO 0 2 3 9 6 TTCTAAAATCTATACACGAATAAAGGTTATACGGGTTACTTTTGATATTGCGGTATTGAA 
70920 70930 70940 70950 70960 70970 

10409 10399 10389 10379 10369 10359 

S a2 5 6 07 6_0 0 0 TTGCATTTATTT AAAATGAAAGGTTAATTTGTTTGTTTTTTTTGCACAAAACCCAAGTTA 

MIIIIIIMIIMMMMIIMIII Mill II IIIIIIIIIIIIMIIIIIIIIMM 

AcO 0 2 3 9 6 TTGCATTTATTT AAAATGAAAGGTTAATTTGTTTGTTTTTTTTGCACAAAACCCAAGTTA 

70980 70990 71000 71010 71020 71030 

10349 10339 
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J 1NA_SEQUENCE 1.0 

ID AF015552 standard; RNA; PLN; 906 BP* 
XX 
AC 
XX 
SV 
XX 
DT 
DT 
XX 
DE 
XX 
KW 
XX 
OS 

oc 
oc 
oc 



XX 
RN 
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RT 
RT 
RL 
XX 
RN 
RP 
RA 
RT 
RL 
RL 
XX 
DR 
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XX 
FH 
FH 
FT 
FT 
FT 
FT 

FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 

FT 
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Arabidopsis thaliana MADS-box (AGL9) mRNA, complete cds. 
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AGL1-AGL6, an Arabidopsis gene 
family with similarity to floralhomeotic 
and transcription factor genes 

Hong Ma, 1 Martin F. Yanofsky, 2 and Elliot M. Meyerowitz 3 

Division of Biology, California Institute of Technology, Pasadena, California 91125 USA 

The predicted products of floral homeotic genes, AGAMOUS (AG) horn Arabidopsis thaliana and DEFICIENS 
A [DEF A) from Antirrhinum majus, have been shown previously to share strong sequence similarity with 
transcription factors from humans (SRF) and yeast (MCM1). The conserved sequence between these proteins is 
localized within a domain known to be necessary for the DNA binding and for the dimerization of SRF. We 
have isolated six new genes from A. thaliana, AGL1-AGL6, which also have this conserved sequence motif. 
On the basis of the sequence comparison between the AG and AGL genes, they can be assigned to two 
subfamilies of a large gene family. RNA dot blot analysis indicates that five of these genes [AGL1, AGLZ, 
AGL4, AGL5, and AGL6) are preferentially expressed in flowers. In addition, in situ RNA hybridization 
experiments with AGL1 and AGLZ show that their mRNAs are detected in some floral organs but not in 
others. Our results suggest that these genes may act to control many steps of Arabidopsis floral 
morphogenesis. In contrast, the AGL3 gene is expressed in vegetative tissues as well as in flowers, suggesting 
that it functions in a broader range of tissues. We discuss possible roles of this gene family during the 
evolution of flowers. 

[Key Words: Floral-specific genes; flower development; gene family; MADS box ; in situ hybridization] 
Received October 30, 1990; revised version accepted December 28, 1990. 



Although flower development has been described in 
some detail, very little is known about the molecular 
machinery that controls cellular differentiation in devel- 
oping flowers. In recent years, the small mustard Arabi- 
dopsis thaliana has been used increasingly for plant mo- 
lecular and genetic studies (Meyerowitz 1987, 1989), and 
a number of Arabidopsis floral homeotic mutants have 
been characterized (Koornneef 1987; Pruitt et al. 1987 ; 
Bowman et al. 1988, 1989; Haughn and Somerviile 1988; 
Komaki et aL 1988; Kunst et al. 1989; Meyerowitz et al. 
1989). Phenotypes of several homeotic mutants indicate 
that they alter floral organ identities (Komaki et al. 1988/ 
Bowman et al. 1989; Kunst et al. 1989). One of these 
homeotic genes is AGAMOUS [AG\. Homozygous ag 
mutant plants produce double flowers (Bowman et al. 
1989; Meyerowitz et al. 1989). In the ag mutant flower, 
while four sepals and four normal petals develop in the 
outer two whorls, as in the wild type, six additional pet- 
als occupy the wild-type positions of stamens. In addi- 
tion, a new flower appears in the position occupied in 
wild type by the ovary. The pattern of 4 sepals surround- 
ing 10 (4 + 6) petals repeats until the whole flower has 
—70 organs (Bowman et al. 1989). The AG gene has been 
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'Corresponding author. 



cloned recently (Yanofsky et aL 1990), and DNA se- 
quence analysis indicates that it encodes a protein that 
shares striking similarity in its amino-terminal portion 
with the DNA-binding domains of transcription factors 
from humans (SRF; Norman et al. 1988) and yeast 
(MCMlj Passmore et al. 1988) # suggesting that the AG 
protein is a transcription factor. Another yeast regula- 
tory gene, ARG80 (Dubois et al. 1987), also has the same 
type of sequence motif. 

Approximately a dozen genes have been defined genet- 
ically to be required for normal floral morphogenesis in 
Arabidopsis (Koornneef 1987). The complex process of 
flower development is likely to require many more reg- 
ulatory proteins to coordinate the formation of floral or- 
gans at the proper time and location. In Drosophila, 
many of the regulatory proteins that control early devel- 
opmental fate share a conserved domain for similar func- 
tions, e.g., DNA binding (Ingham 1988). By analogy, it is 
possible that the conserved putative DNA-binding do- 
main of AC is shared by other regulators of flower de- 
velopment. In fact, a recently cloned flower homeotic 
gene from Antirrhinum majus (snapdragon), DEF A t also 
encodes a protein with the same type of DNA-binding 
domain (Sommer et al. 1990). This conserved motif has 
since been called the MADS box ( for MCM1, AG and 
ARG80, DEF A, and SRF). Mutations in the DEF A gene 
cause phenotypes in snapdragon flowers that are very 
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different from those of ag mutants: The petals are re- 
placed by sepals, and the stamens are replaced by carpel- 
like tissues, while the outer sepals and inner carpels are 
normal (Sommer et al. 1990). 

In an effort to gain further understanding of Aiabidop- 
sis flower development, we set out to isolate genes that 
share sequence similarity with AG. Here we report the 
isolation and characterization of six genes that share 
substantial sequence similarity with AG and DEF A. 
They are designated AGL1-AGL6 for AG-like. We 
present the sequences of the AGL genes and their expres- 
sion patterns. The possible functions of this large family 
of regulatory genes in flower development, and its pos- 
sible role in the evolution of the flower, are discussed. 



Results 

Isolation of AGL genomic and cDNA clones 

The region of amino acid sequence similarity between 
the AG protein (Yanofsky et al. 1990), the known tran- 
scription factors SRF (Norman et al. 1988) and MCM1 
(Passmore et al. 1988), and the yeast regulatory protein 
ARG80 (Dubois et al. 1987) is localized within a 56-res- 
idue domain (the MADS box) in the amino- terminal re- 
gion of these proteins. A highly conserved octapeptide, 
KKAYELSV, is found within the MADS box. A set of 
degenerate oligonucleotides was generated based on this 



octapeptide (see Materials and methods). Low -stringency 
hybridization of an Aiabidop sis genomic DNA blot with 
this set of oligonucleotides as probes revealed -20 bands 
(data not shown). These oligonucleotides were then used 
to probe a cosmid library (Yanofsky et al. 1990) made 
from Arabidopsis nuclear DNA, and 46 clones were iso- 
lated. Southern blot analysis showed that 12 of the 
clones hybridized to an AG cDNA clone under moderate 
stringency (data not shown). On the basis of patterns of 
restriction fragments and hybridization with the AG 
cDNA, we concluded that these 12 clones most likely 
represent four genes, named AGL1-AGL4. This was con- 
firmed later by DNA sequencing (see below). Represen- 
tative cosmids were chosen for further analysis. 

AGL1 and AGL2 genomic fragments were used to 
probe a XgtlO-based cDNA library constructed from Ar- 
abidopsis floral poly(A) + RNA (Yanofsky et al. 1990). 
DNA sequence analysis (see below) revealed that among 
the cDNA clones isolated with an AG LI probe (probe 1, 
Fig. IB) there were not only clones for AGL1 but also for 
AGL2 and for one additional gene, designated AGL5 (Fig. 
1A). Similarly, AGL2 and AGL4 cDNA clones (Fig, 1A) 
were isolated with an AGL2 probe (probe 2, Fig. IB). Be- 
cause these clones hybridize to the AG gene at moderate 
stringencies, we probed the cDNA library with an AG 
cDNA at a moderate stringency (see Materials and meth- 
ods). Only moderate to weak positives were analyzed; a 
total of 27 AGL clones, including AGL3 and another 
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Figure 1. {A) Maps of AGL cDNAs. [AG and AGLl, 
AGL2, AGL4, andAGL6) Composite maps from two 
cloned fragments, the ends of which are marked by 
£coRI sites (the cDNA and genomic sequences agree 
with each other; for clone numbers, see Materials 
and methods). All clones have EcoRl sices (all are not 
shown) at both ends; only £coRl, Hindlll, and other 
sites used for demarcation purposes are shown. The 
symbols {■} and (O) indicate the positions of trans- 
lation^ initiation and termination codons, respec- 
tively. The A n signs represent polyfA) tails. The 
solid bars indicate the position of the MADS boxes, 
and the hatched bars indicate the position of the 
conserved K boxes. The region at the carboxyl ter- 
minus of AGLS, represented by the dashed line, is 
from genomic sequence. The asterisk | # ) in AGLS 
indicates the position of sequence AGGCTT, one 
base different from the HindUI site (AAGCTT) 
found at this position in the other cDNAs. Enzyme 
keys: (C) Seal, (H| Hindffl; (NJ Ndeh (P) Hpah (R) 
£coRI ; (S) Sspl, (X) Xhol \B) AGLl, AG 12, AGL4 t 
AGLS gene structures (for clone numbers, see Ma- 
terials and methods). The boxes indicate exons (open 
boxes represent untranslated regions), and the lines 
between them represent introns. All of the introns 
have the canonical donor and acceptor sites, GT and 
AG, respectively. The boxes with dashed lines at the 
S'-most portion of AGL4 and AGLS represent re- 
gions of uncertainty because of the lack of genomic 
information; the hatched box at the 3' region of 
AGLS lacks cDNA confirmation. The bars above AGLl, AGL2, and AGL4 represent regions of the corresponding genes used as probes 
to isolate cDNA clones. The dashed lines indicate the introns that are lacking in some genes ; the other introns between the solid lines 
connecting different genes have conserved positions. 
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gene, AGL6 (Fig. 1A), were isolated. AGLS genomic 
cosmids (probed with an AGLS cDNAJ and additional 
AGL2 and AGL4 cosmids (probed with an AGL2 cDNA) 
were isolated from the cosmid library. The 3' portions of 
AGLl, AGL2, and AGL4 cDNAs were isolated subse- 
quently using gene-specific genomic fragments as 
probes. 

The AGL gene structures and nucleotide sequences 

We have determined the sequences of AGL cDNAs (Figs. 
2-4; AGL3 cDNA sequence is incomplete and not 
shown); as well as the entire genomic regions for AGL1 
and AGL2 and most of the AGL4 and AGL5 genes. On 
the basis of the comparison between cDNA and genomic 
sequences, we have deduced the complete exon-intron 
structures for AGL1 and AGL2 and nearly complete 
structures for AGL4 and AGL5 (Fig. IB). The intron po- 
sitions are largely, though not entirely, conserved in all 
of the genes where the intron positions are known (Fig. 
IB). 

The AGLl, AGL4, and AGL6 cDNAs each contain a 
large open reading frame (ORF), as well as 5'- and 
3 '-non translated regions and a poly) A) tail (Figs. 1 A, 2^4). 
Although the cDNA clones for AGL2 and AGLS do not 
include poly(A) tails (see Materials and methods), the 
entire protein-coding regions for these two genes have 
been identified (Figs. 1A, 2, and 3). The AGLS cDNA 
clone does not contain the termination codon for the 
longest ORF, but comparison of the genomic sequence 
matching the end of the AGLS cDNA with the carboxy- 
terminal sequences of AG and AGL1 suggests the prob- 
able carboxyl terminus of the AGLS protein (Fig.2). The 
sequence of the amino- terminal portion (including most 
of the MADS box) of the AGL3 protein (Fig. 5B) has been 
deduced from the cDNA sequence (data not shown). Ad- 
ditional AGL3 protein sequence (Fig. SB) has been de- 
duced from genomic sequence (data not shown) using the 
canonical intron donor (GT) and acceptor (AG) sites. The 
proteins encoded by the AGL genes are small (28.2-28.8 
kD calculated molecular mass) and slightly basic, simi- 
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Figure 2. AGLl and AGLS cDNA and deduced protein se- 
quences. The complete coding region of AGL1 is shown, and 
only the nucleotides in AGLS that are different from AG LI are 
shown below the AGL1 sequence. The amino acid sequences 
are shown in boldface. The dashes in both nucleotide and amino 
acid sequences indicate gaps introduced to allow the best align- 
ment. Where AGL 1 and AGLS amino acid residues are identical, 
they are shown once; where they are different, the AGL1 resi- 
dues are shown above the AGL5 residues. For AGLS, the DNA 
sequence starting at nucleotide 782 and the last 12 amino acids 
residues are from genomic sequences. The calculated molecular 
masses for ACL1 and AGL5 are 28,337 and 28,158 dahons, re- 
spectively. The potential phosphorylation sites |RXX(T/S)] and 
glycosylation sites [NX{T/S|| are underlined. The positions of 
introns shared by AGL1 and AGLS are represented by the num- 
ber ( | sign above the AGLl nucleotide sequence; the one in- 
tron that is present only in AGLl is indicated by the dollar {$) 
sign. The untranslated regions for AGLl and AGLS are shown 
separately, as indicated. 
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SYRBYLKIiKORYBNLQRQQRHLLOBDLtOPI* 120 

t 

AATTCAAAGGAGTTAGAGCAGCTTGAGCGTCAACTGGACGGCTCTCTCAAGCAAGTT^ 836 

C A G C G T 78S 

N9KBLHQLBRQLD08LRQVR 8 IKTQYMLDQ ISO 

C ff 

CTCTCGGATCTTCAAAATAAAGAGCAAATGTTGCTTGAAACCAATAGAGCTTTGGCAATGAAGCTC 926 
T GGGTC TG C T A CCCA87 8 

LaDLQ H KBO M LL BT li R A !■ A MRL D DMIOVR 8 180 
O X DA 8. B H 

CATCATATG GG AGG ATGGGAAGG CGGTGAA CAGAATGTTACCTACGCGCATCATCAAGCTCAGTCTCAGGGACTATACCAGCCT 1010 

C AGGA T TCAA AG T GA C G T AT 968 

HH M ~OQWBGQ B_ 0 H * 1 Y X H 8 Q A 0 S Q Q X, Y Q P 208 

10 DOIAOPB 8 210 
* 

CTTGAATGCAATCCAACTCTGCAAATGGGGTATGATAATCCAGTATGCTCTGAGCAAATCACTGCGACAACACAAGCTCAGGCGCAGCCG 110 0 

TG C T T A AGCC G A GG T GGTG G T C A AA 1058 

L B C P T L Q OY PVCSBQ T Q Q Q 

O I 8H MAVVOSQ 240 

GG AAACG GTTAC ATTCC AG GATGGATGCTCTGAGAATCATGTACTGTGATGAAGCTCACCCACAAAAGACCTTATATAT ATATAAA GTAT 1190 

C CTC G GCGATACTTCTTCCCCCAATMAGATCTTAAGCAAGTACTGGTGGGGTCTTCGTGGT 1148 

OHOYI P O If If L find 248 

250 

( 2 ) AG ATACAAG AC TTGGATTTGTAG ACATAAG TGGC T AAT AT A A TCGTCC^ 1277 

G CTTAT ATTAA G AATTC 1294 
{ 4 ) GTGATCTTAGATCTTATGCATATG AATAATAAT GTTATTGCACAAGACTTTTGCTTTTGTAGA 1235 
CCTTCAACATCTCTCTTCTGTTTCAGGATTTGTTTC 1322 

S $ 
TAACATCG A CC ATG TC TC ATCTG G TG An 
$ S 

Figure 3. AGL2 and AGIA cDNA and deduced protein sequences. The coding regions are shown in the same way as in Fig. 2, with 
AGL2 sequences above AGL4 sequences. The dashes in both nucleotide and amino acid sequences indicate gaps. The calculated 
molecular masses ior AGL2 and AGL4 are 28,456 and 28,579 daltons, respectively. The 5 '-untranslated regions are shown with AGL2 
above AGL4 sequences, as indicated with |2) and (4). The two small ORFs are highlighted in boldface, and the flanking conserved 
sequences are underlined. The potential phosphorylation sites and glycosylation sites of the predicted proteins are underlined. Intron 
positions for both AGL2 and AGL4 are denoted with number (#) signs. The 3' -untranslated region for the two cDNAs are shown 
separately. In the 3 '-untranslated region of AGIA, the dollar ($) signs indicate the positions of four observed polyadenylation sites, and 
potential polyadenylation signals are underlined for both' AG L2 and AGIA. 



lar to the AG protein (Yanofsky et al. 1990). Table 1 
shows the percentages of identical residues between AG 
and the AGL proteins in different regions. Like AG, the 
AGL proteins all have the MADS box, as discussed be- 
low. 



The regions in the AGL2 and AGIA cDNAs 5' of the 
long ORFs each have a short ORF beginning with an 
ATG codon (Fig. 3). These short ORFs differ by only one 
nucleotide and potentially encode the identical hep- 
tapeptide MFLCVCI. The sequences flanking these short 
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ATTTATCGTGTAC 13 
GATACTTTATTCCTTTTATCTATTCTTGAAAAAAAGTTACCAATT 58 
CTTG AG AAGAAGAAGAAATCAG AATCAAGAG AAGG AGAGAGAAAG 103 

ATGGGAAGAGGGAGAGTGGAGATGAAGAGGATAGAGAACAAGATT 1 48 
MORORVBMiRIBBKI 15 

AATAGACAAGTGACCTTCTCAAAAAGAAGAAACGGTTTGCTGAAG 193 
■ MOV* F0KRRHOLLK 30 

AAAGCTTATCAGCTTTCTGTTCTTTGCGATGCCGAAGTTGCTCTC 238 
XATBLflVLCDAIVAL 45 

ATCATCTTCTCAAGCCGTGGCAAGCTCTACGAGTTTGGTAGTGTT 2 83 

IirflBROKLTBPOSV 60 

GG AATTG AAAG C ACAATCG AACGG T AT AATCGTTGTT AC AACTGC 328 

QIHgT IBRTITRCY M C 75 

TCTCTAAGCAATAATAAGCCTGAAGAGACTACACAG AGTTGGTGT 373 
AL0K1IKPBBTTQSWC 90 

CAGGAGGTGACAAAGCTTAAATCCAAATACG AATCTCTTGTTCGT 418 
QBVTKLK8KTB8LVR 105 

ACT AAC AGGAATTTG CTTGG AG AAG A TCTTGG AG AAATGGGTGTG 463 
THRMLLOSDLGBMQV 120 

AAGGAACTGCAAGCGCTCG AGAGGCAGCTCGAAGCCGCTCTTACC 508 

KBLQALBRQLBAALT 135 

GCG ACTCG ACAGCG C AAG AC A C AAGTTA TG ATGG AAGAAA TGGAA 553 

A T R Q R KTQVHMBBMB 150 

GACCTTAGGAAAAAGGAGAGGCAACTAGGAGACATAAACAAACAA 598 
DLRBKBRQLODIMXQ 165 

CTCAAGATTAAGTTTG AAA CGG AAGG CCATGCTTTCAAAACCTTT 643 
LKIKPBTBOBAFKTF 180 

CAAGACTTATGGGCAAACTCGGCGGCATCGGTGGCCGGGGATCCA 6 B8 

0DLVAM8AASVAODP 195 

AACAATTCTG AATTTCCGGTAG AG CCTTCTC ATCCT AATG TA TTG 733 

M a & BPPVBPSHPHVL 310 

GATTGCAACACCGAACCCTTTTTACAAATAGGGTTTCAACAACAT 778 
DCNTBPFLQIGFQQH 225 
TACTACGTGCAAGGTGAAGGGTCTTCGGTATCAAAGAGTAACGTG 823 
TTVQOBOSSVflKSMV 240 
GCAGGTGAGACTAATTTCGTCCAAGGTTGGGTTCTTTGA 862 
AOBTHFVQOWVL Ind 252 
CTCTCTG TTG ATT AG CCCA CG A TGCCACGG TC AGGCC AATTTC AGC 908 
TCTCTACAGTTGTTCTTTTTTC AAATT AG A TTTCTGG TTTTTTTTT 954 
TCCTATAAGAAAAACTTTTGCACTAGATGTTTGTCATTTAATTTCC 1000 
AGCTCGTGTGAATCTATATTCGCATGTATGTGCTTTGAAGAATTTC 1046 
$ 

TCCTCTTACTCCTACTTGATCTAAAACATTATTTTTGTTTTGGGTTTAn 

$ 

Figure 4. AGLS cDNA and deduced protein sequence. The cal- 
culated molecular mass for AGL6 is 28,744 daltons. The poten- 
tial phosphorylation sites and glycosylation sites are under- 
lined. The positions of two observed polyadenylation sites are 
denoted with a dollar ($) sign below the nucleotide. 



ORFs are also highly conserved between AGL2 and 
AGIA (36/45 and 17/18 identity for 5' and 3', respec- 
tively). However, the nucleotides immediately adjacent 
to the ATGs of these small ORFs do not match the 
plant initiation consensus sequence: (A/T)(C/A)AAC- 
AATG GC (Lutcke et al. 1987). The presence of short 
ORFs upstream of the protein-coding region has been 
observed previously for other genes in yeast (Hinnebusch 
1984 f Werner et al. 1985; Forsburg and Guarente 1989), 
in animals (Kozak 1987), and in plants (Ma et al. 1990; 
Schmidt et al. 1990). In yeast, it is known that the short 
ORFs in the GCN4 and CPA1 (Hinnebusch 1988) mRNA 
are required for proper translational regulation. 



Map positions of AGL1, AGL2, and AGL3 

As a step toward determining whether the AGLs corre- 
spond to any genes identified previously by genetic anal- 



ysis, we localized AGL1, AGL2, and AGL3 relative to 
other molecular markers using restriction fragment 
length polymorphisms (RFLPs; Chang et al. 1988). AGL1 
maps on chromosome 3 near the lower end, —0.6 cM 
from the marker 460 on the RFLP map constructed by 
Chang et al. (1988); AGL2 maps on chromosome 5 about 
0.6 cM centromere-proximal from the chalcone synthase 
gene; and AGL3 maps on chromosome 2 near the upper 
end / —2.6 cM from the marker 246. From these mapping 
results, AGL1-3 do not appear to coincide with any gene 
identified previously by mutations. The AGIA, AGLS, 
and AGL6 genes did not reveal any RFLPs between the 
ecotypes used in our crosses, and have not been mapped. 



AGL1, AGL2, AGL4, AGLS, and AGL6 aze expressed 
preferentially in flowers 

Because cDNA clones for all six AGL genes have been 
isolated from a cDNA library constructed from floral 
poly(A) + RNA, it follows that these genes are expressed 
in flowers. Their expression patterns were further char- 
acterized using RNA dot blot hybridizations. RNAs from 
immature seed pods, flowers, stems, and leaves were 
spotted onto nylon filters, and identical filters were 
probed with each of the labeled 3' portions of AGL1, 
AGL2, AGL4, AGLS, and AGL6 cDNAs (lacking the se- 
quences encoding the MADS box to minimize cross-hy- 
bridization) and the only available AGL3 cDNA (includ- 
ing the sequences encoding the MADS box). As a contr 1, 
radiolabeled AG cDNA was also used to probe one of the 
RNA filters. The results (Fig. 6) agree with the previous 
finding (Yanofsky et al. 1990) that AG is expressed in 
flowers but not in leaves or stems. Five of the AGL genes 
(except AGL3) are expressed preferentially in flowers, 
and the expression continues, albeit diminished, in im- 
mature seed pods (Fig. 6). Faint signals were also detected 
with AGL1 and AGLS in stems, and with AGL2 in 
leaves. AGL3 is expressed in stems and leaves, as well as 
in flowers and seed pods (Fig. 6). As controls for cross- 
hybridization, in vitro transcripts of the sense orienta- 
tion from AGL1, AGL2, AGIA, AGLS, and AGLS cDNA 
were synthesized and spotted on the same filter strips. 
No cross-hybridization between any of the AG and the 
AGL probes and in vitro transcripts was observed under 
the conditions used (Fig. 6). The approximate levels of 
AGL expression are slightly lower than that of AG, 
which was estimated to have an average abundance of 1 
in 10* poly(A) + RNA molecules in floral tissues (Yanof- 
sky et al. 1990). This result agrees with the observed 
frequency of AGL cDNA clones in the cDNA library. 



In situ RNA hybridizations with AGL1 and AGL2 

To determine whether these genes are expressed in an 
organ-specific manner, the expression patterns of AGL1 
and AGL2 were characterized in more detail by in situ 
hybridization. Wild-type A. thai ion a (Landsberg erecta] 
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AG 

AGL1 

AGL5 

AGL2 
AGL4 
AGL6 
AGL3 




Figure 5. The comparison of deduced AG and AGL 
structures and alignment of conserved domains. The 
asterisks ( * ) represent identity to the first sequence 
of the same group; dashes indicate gaps introduced 
for alignment purposes. The alignment was done us- 
ing the FAST? program |Upraan and Pearson 1985). 
{A J The comparison of different regions of AG and 
the AGL protein. The four regions (MADS, I, K, and 
C ; see Table 1 for more detailed data on percent 
identities) are represented by differently shaded 
boxes. The numbers at the top are percent identities 
between AG, AGL1 and AGLS (subfamily I), the 
numbers at the bottom are those between AGL2, 
AGL4, AGL6, and ACL3 (subfamily II), and the 
numbers in the middle are those between any mem- 
ber of subfamily I and any of subfamily II. The AGL3 
information is partly derived from genomic se- 
quence based on similarity to other AGLs and ca- 
nonical intron donor and acceptor sites; the dashed- 
line boxes represent presumed unknown regions. 
The sequences in the MADS and K boxes are com- 
pared in B and C, respectively. [B] The alignment of 
the AG (Yanofsky et al. 1990) and AGL MADS boxes 
with those of DEF A (Sommer et al. 1990), SRF (Nor- 
man et al. 1988), MCM1 (Passmore et al. 1988) and 
ARG80 (Dubois et al. 1987) proteins. The conserved 
phosphorylation site (1) and the peptide (2) used to 
derive degenerate oligonucleotide sequences are in- 
dicate with lines above the AG sequence. (C) The 
alignment of the plant protein K boxes and region of 
the human type II keratin (Krt ; see Tyner et al. 
1985). Two possible helices are indicated by lines 
above the regions, and the circles (O) above the AG 
sequence indicate residues at positions a and d in the coiled coil heptapeptide repeat structure (Steinert and Roop 1988). The plus ( + ) 
signs below the keratin sequence represent an identity of the keratin residue to the corresponding residue in at least three of the AG, 
AGL, and DEF A sequences; the number (#) signs represent similar residues between keratin and at least four of the AG and AGL 
sequences. 
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inflorescence sections were hybridized with 35 S-labeled 
antisense RNA probes from AGL1 and AGL2 cDNAs, 
The 3' portions of the cDNAs lacking the putative DNA- 
binding domain were used to avoid cross-hybridization. 
As shown in Figure 7, AGL1 is expressed in carpels, par- 
ticularly in ovules but not in stamens, petals, or sepals. 
AGL2 is expressed mainly in carpels; in addition, the 
AGL2 probe also detects a weak signal in stamens. 
Within the stamens, the AGL2 signal is restricted to the 
anthers and is not observed in the filaments. Similar to 
AGL1, the AGL2 signal in carpels is concentrated in 
ovules. The expression of AGL1 and AGL2, as detected 
by in situ hybridization, begins in stage 10 flowers 
(Smyth et al. 1990) after all of the floral organs are rec- 
ognizable and the ovules are visible. This onset of AGL1 
and AGL2 expression is much later than that of AG, 
which is seen before the separation of petal and stamen 
primordia from the central floral primordium (G. Drews, 
J. Bowman, and E.M. Meyerowitz, unpubl.), in stage 3 
flowers (Smyth et al. 1990). The expression of AG, 
AG LI, and AGL2 all extend into later stages of flower 
development, including immature seed pods (Figs. 6 and 
7). For AGL1 and AGL2 f the in situ signals are str nger 
in older organs (Fig. 7). 



Discussion 

AG and the AGLs constitute a gene family 

We have identified and characterized six genes from A. 
thaliana, designated AGL1-AGL6. The deduced AGL 
proteins all share striking sequence similarity (Fig, 5) 
with each other and with the products of the floral ho- 
meotic genes AG and DEF A. Sequence analysis indi- 
cates that AG and AGLs are members of a diverse gene 
family. Table 1 shows the percentage of amino acid se- 
quence identity between these deduced proteins in four 
regions. The most conserved region, the MADS box (M), 
is located either at or very near the amino terminus in 
the AGLs. The second conserved domain (the K box,- see 
below), not found in SRF and MCM1, is near the center 
of the proteins, -35 residues from the MADS box. On 
the basis of sequence comparison, AG, AGL1, and AGLS 
can be assigned to one subfamily, and AGL2, AGL4, and 
AGL6 can be assigned to another subfamily. It is worth 
noting that the sequence similarity shared between 
members of the same subfamily is not restricted to the 
two conserved regions but extends throughout the entire 
length of the proteins (Fig. 5 ; Table 1). The subfamily 
assignment is also supported by the exon-intron struc- 



GENES & DEVELOPMENT 489 



BNSDOC1D: <XP 905073A_L> 




JVUetal. 

Table 1. Percentage of amino acid identity in different domains of AG. the AGLs, and DEFA proteins 



AG ACU AGLS AGL2 AGL4 AGL3 AGL6 DEFA 



Genes 


I 


C 


M 


K 


M 


K 


M 


K 


M 


K 


M 


K 


M 


K 


M 


K 


AG 






95 


68 


95 


62 


82 


32 


84 


30 


80 


37 


84 


38 


71 


24 


AGL1 


71 


39 






100 


92 


82 


33 


80 


33 


82 


29 


82 


33 


68 


20 


AGLS 


65 


37 


95 


72 






82 


36 


80 


35 


82 


33 


82 


33 


68 


23 


AGL2 


14 


14 


11 


11 


11 


10 






98 


97 


95 


59 


95 


53 


62 


29 


AGL4 


17 


14 


14 


10 


14 


9 


94 


70 






93 


57 


93 


55 


62 


29 


AGL3 


14 




14 




14 




42 




39 








89 


51 


61 


24 


AGL6 


23 


10 


23 


12 


23 


11 


34 


19 


29 


19 


17 








57 


23 




I 


C 


I 


C 


I 


C 


I 


C 


I 


C 


I 


C 











The percentages for the MADS box |M, shown in boldface, 56 residues) and the second conserved domain {K, 66 residues except for 
AGL3, of which only 49 residues are known) are shown above the diagonal (blank space); the percentages for the sequences between 
the two conserved domains (1, 34-36 residues) and the carboxy- terminal regions (C, 78-98 residues) are shown below the diagonal. 
Because the AGL3 carboxy-terminal sequence is not known, the percentage of identity could not be calculated. See Fig. 5A for the 
domain organization of AG and the AGLs. The percentages of the two conserved domains (M and K) for Antirrhinum protein DEF A 
are also shown. 



tures. An analysis of third-base silent changes in the 
MADS box indicates that the two most similar pairs 
(AGL1 and AGL5 ; AGL2 and AGL4) have much smaller 
percentages of difference |<30%) than other pairs, sup- 
porting the subfamily structure. Although the incom- 
plete AGL3 sequence does not allow definitive assign- 
ment of the AGL3 gene, the partial sequence data indi- 
cate it is more similar to the AGL2, AGL4, and AGL6 
subfamily than the AG, AGL1, and AGLS subfamily. Fig- 
ure 8 illustrates the relationship between AGLs and AG 
based on the sequence information. Sequence compari- 
son of the DEF A gene to AG and the AGL genes suggests 
that DEF A does not belong to either of the two subfam- 
ilies (Table 1; Fig. 8). Low-stringency hybridization of 
Arabidopsis genomic DNA with degenerate oligonucle- 
otides revealed —20 bands, more than accounted for by 
the seven genes that have been isolated; therefore, it is 
likely that there are additional members of this gene 
family in Arabidopsis. We propose that members of this 
gene family are derived from a single ancestral gene and 
have arisen by gene duplication and subsequent se- 
quence divergence and intron loss (Fig. 8). 

AGLs likely encode transcription factors 

The deduced AG and AGL amino acid sequences, as well 
as the product of the snapdragon floral homeotic gene 
DEFA (Sommer et al. 1990), contain a sequence motif of 
-56 amino acids (the MADS box ? see Fig. 5B) that is also 
found in the transcription factors SRF (Norman et al. 
1988) and MCM1 (Passmore et al. 1988). A region of -90 
residues containing the MADS box is known to be suf- 
ficient for DNA binding and is involved in dimerization 
of SRF (Hayes et al. 1987,- Norman et al. 1988). Recent 
evidence (Tan and Richmond 1990) suggests that the 
yeast MCM1 MADS box is also sufficient for specific 
DNA binding. The human SRF is involved in the regu- 
lation of the proto-oncogene c-/os (Treisman 1986, 1987) 
and a sarcomeric actin gene (Boxer et al. 1989). The yeast 
MCM1 gene product (GRM/PRTF) regulates mating 



type-specific gene expression (Herskowitz 1990). The 
phenotypes of Arabidopsis ag mutants (Bowman et al. 

1989) and Antirrhinum defA mutants (Sommer et al. 

1990) suggest that these genes play regulatory roles in 
specifying the identity of floral organs. The fact that the 
AGL proteins also contain the MADS box suggests that 
they are also transcription factors, possibly regulating 
floral morphogenesis. The AGL proteins may function in 
different floral cells, controlling branches of the floral 
morphogenesis regulatory hierarchy. Alternatively, they 
may function at different times, directing different 
stages of flower development. Although all of the AGL 
proteins are probably transcription factors, their expres- 
sion patterns (see below) suggest they control different 
sets of genes. 

In addition to the MADS box, the AG and AGL pro- 
teins share a second domain, which has a low but signif- 
icant similarity to a portion of keratin sequences (the K 

RNAs 



P F S L 1 2 4 5 6 




Figure 6. RNA dot blot autoradiogram. RNAs from immature 
seed pods (P), floral buds (F), stems (S), and leaves (L), as well as 
in vitro-synthesized RNAs from AGL1 (J J, AGL2 (2), AGL4 (4), 
AGLS |5), and AGL6 (6), were spotted onto eight nylon filter 
strips. Each strip was hybridized with one 32 P-labeled cDNA: 
AG, AGL1 [11 AGL2 (2|, AGL4 (4|, AGLS (5), AGLS (6), AGL3 
(3), and GFA1. GPAl is expressed in both floral and vegetative 
tissues (Ma et al. 1990). Similar amounts of radioactivity were 
used in all of the hybridizations. A single autoradiographic ex- 
posure was used. 
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Figure 7. RNA in situ analysis with AGll [A-F] and AGL2 {G-L). A-C and G-/ were photographed in bright field, and the others in 
dark field. For each gene, at least three developmental stages (for a description of stages 1-12, see Smyth et al. 1990} are shown: {A and 
D) stages 5-6; \G and J) stage 9; (fl and B) stage 12; {H and K) stage 10; (C, F, J, and L) after flower opens but just before pollination; 
stage 13. Floral organ designations: (S) sepal; (P) petal; (TJ stamen; |A) anther; |F) filament; |C) carpel; (O) ovule. The grains seen on the 
surface of some sepals and around the pollen grains are also seen when the sense-strand probes were used and, therefore, probably 
represent nonspecific sticking of the probes. Prints of one enlargement were used for A-F, J, and L, and a different enlargement for G, 
H, ], and K. [A and G) Bars, 100 nxn. 



box ; Fig. 5C). Keratins are major components of interme- 
diate filaments (Steinert and Roop 1988). It is known 
that the region of keratin with similarity to AG and ACL 
proteins is part of the coiled coil sequence that forms the 
central rod-shaped domain of keratin (Fig. 5C). The AGL 
sequences in this domain can potentially form two am- 
phipathic helices. 

Phosphorylation and gly cosy lation may modulate the 
activity of AGLs 

SRF (Prywes et al. 1988; Ryan et al. 1989) is known to be 
phosphorylated. Furthermore, the phosphorylation of 
SRF has been shown to affect its activity (Prywes et al. 
1988). Other transcription factors have been suggested to 
be regulated by phosphorylation, such as the yeast heat 
shock factor (Sorger and Pelham 1988) and GAL4 protein 
(Mylin et al. 1989). The difference between the apparent 
size (Tan and Richmond 1990) and sequence-derived size 
(Passmore et al. 1988) of the MCM1 protein suggests that 



it is also post-translationally modified. There is a con- 
served potential site {RQVT; Fig. 5C| for calmodulin- 
dependent phosphorylation |RXX(S/T); see Cohen 1988) 
in all of the AGLs, and most have additional sites as well 
(Figs. 2-4). Furthermore, it was reported that SRF is a 
glycoprotein (Schroter et al. 1990). Several other eukary- 
otic transcription factors are also glycosylated (Jackson 
and Tjian 1988; Lichtsteiner and Schibler 1989). The 
AGL proteins all have potential glycosylation sites (NXT 
or NXS ; see Fishleigh et al. 1987; Figs. 2-4). The pres- 
ence of these sites suggests that the activity of AGLs 
may be modulated by phosphorylation and/or glycosyla- 
tion, perhaps in response to environmental and develop- 
mental signals. 

Expression and functional implications of AGLs 

Five of the AGLs are expressed preferentially in flowers 
and young seed pods but not (or at low levels) in leaves or 
stems. At this level, they are similar to known floral 
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Figure 8. A proposed relationship between AG, AGL genes, 
and DEF A. The number of introns is indicated in parenthesis 
below each gene where it is known. The dashed line leading to 
AGL3 represents the uncertainty of the position o(AGL3 due to 
the incompleteness of its sequence information. The time at 
which introns were lost during the evolution of AGL2- 
AGL4,andAGL6 cannot be deduced because intron information 
is not available for AGL3 and AGL6. 



homeotic genes, AG from A. thaliona and DEF A from A. 
majus. The in situ hybridization results, on the other 
hand, indicate that the patterns of AGL1 and AGL2 ex- 
pression within the flower are slightly different from 
that of AG. The AG expression begins in morphologi- 
cally undifferentiated cells (G. Drews, J. Bowman, and 
E.M. Meyerowitz, unpubl.) in early developing flowers at 
stage 3 {Smyth et al. 1990). Later in development, AG is 
expressed in both carpels and stamens, including anthers 
and filaments (G. Drews, J. Bowman, and E.M Meyerow- 
itz, unpubl.). The onset of AGL1 and AGL2 expression is 
at stage 10, much later than that of AG. In addition, 
AGL1 is expressed preferentially in carpels, not in sta- 
mens, petals, or sepals. The AGL2 signal is found prima- 
rily in carpels and is lower in stamens. In carpels, the 
expression of both AGL1 and AGL2 is concentrated in 
ovules. Although AGL2 and AG are both expressed in 
the stamens, AGL2 mRNA is found only in the anthers, 
not in the filaments. These expression patterns suggest 
that AGL1 and AGL2 may regulate different genes from 
those that are controlled by AG. The fact that AGL1 and 
AGL2 share amino acid similarity with transcription fac- 
tors, and that they are expressed in ovules suggest that 
they are both involved in regulating ovule development. 
The flower-specific expression of the AG, AGL, and DEF 
A genes suggests that the MADS box has been used re- 
peatedly for flower development. On the other hand, be- 
cause AGL3 RNA is expressed in vegetative tissues as 
well as floral tissues, it is likely that this class of tran- 
scription factors is not exclusive to flower development. 

Flowering plants appeared suddenly and very recently 
in evolution at -140 million years ago [Lower Creta- 
ceous). Some have suggested that floral organs are mod- 
ified leaves, through a series of structural and functional 
changes during evolution (Stebbins 1976). In fact, the 
combination of mutations in three Arabidopsis ho- 
meotic genes, AG, APETALA2, and APETALA3, leads to 
"flowers" consisting of only leaf-like organs (J. Bowman 
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D.R. Smyth and E.M. Meyerowitz, in press). The pres- 
ence of MADS-box containing genes in yeast, plants, and 
humans argues that this class of genes predates flowering 
plants. Therefore, it is likely that MADS-box regulatory 
gene or genes was present and functioning when flowers 
arose. Furthermore, during the evolution of flower struc- 
tures, it is possible that these genes were duplicated, and 
some members diverged to take on new functions in flo- 
ral morphogenesis, either interacting with different pro- 
teins, or binding to DNA with slightly different specific- 
ities. The family of genes described here includes at least 
one member [AGL3] that is expressed in both vegetative 
and floral tissues, presumably fulfilling a more wide- 
spread function. Additional members {AG, AGL1, AGL2, 
AGL4, AGL5, and AGL6 and possibly others) of this gene 
family, presumably arisen by gene duplications, and 
have evolved to be preferentially expressed in flowers. At 
least two members of this gene family [AG from Arabi- 
dopsis and DEF A from Antirrhinum) are known to con- 
trol flower development based on genetic analyses (Bow- 
man et al. 1989; Sommer et al. 1990). Future analyses are 
required to test our hypotheses about the function of the 
other MADS box genes. 



Materials and methods 

Library screening, clones, and subclones 

A genomic cosmid library (Yanofsky et al. 1990) made from 
nuclear DNA of A. thaiiana (Landsberg erecta) was screened 
with radiolabeled degenerate oligonucleotides according to pre- 
viously published procedures (Burglin et al. 1989). The oligonu- 
cleotides 5 '-ACNG AN AG YTCUTANGCYTTYTT-3 '(N = A, 
G, C or T, U = A or G; Y = C orT) are based on the conserved 
heptapeptide KKAYELSV in the MADS box (Yanofsky et al. 
1990). More than 70 positives were detected among colonies of 
four genomes worth, and cosmid DNA of 46 of the positives 
were purified. Additional screening of the cosmid library with 
ACL2 and AGL5 cDNAs was done as described previously (or 
hybridization of genomic DNA (Chang et al. 1988). Represen- 
tative cosmids were characterized further: AGL1 pCIT1202 
and pCrmiO, AGL2, pCIT1243; AGLS, pCIT1216; AGIA 
PCIT1247 and pOT4244; andAGLS, pCIT4243. AGL6 cosmids 
have not yet been isolated. Portions of the cosmids pCIT1202, 
1243, 4244, and 4243 are shown in Figure IB. 

About 1 x 10 6 plaques of a cDNA library constructed from 
floral bud poly(A) + RNA (Yanofsky et al. 1990) were screened at 
a moderate stringency (as described previously by Chang et al. 
1988, except that the hybridization and washes were done with 
5x SSPE at 52"C) with a 977-bp AG cDNA £coRI fragment 
|pCIT565) as a probe. The cDNA library was also screened at 
high stringency (65°C hybridization and a final wash with 0.2x 
SSPE) with several probes [Fig. IB): probe 1, a U-kb DralAGLt 
genomic fragment (from pCIT1202); probe 2, a 0.36-kb BglU 
AGL2 genomic fragment (from pOT1243). During the construc- 
tion of the cDNA library, cDNAs with internal £coRI site(s| 
were cleaved and then ligated into separate vector molecules 
The AGL1, AGL2, AGL3, AGIA, and AGLS cDNAs all contain 
at least one £coRI site, therefore, each of the cDNA had to be 
isolated as two or more separate fragments. The 3' portions of 
AGL3 and AGLS cDNAs have not yet been isolated. The por- 
tions encoding the amino terminus containing the conserved 
DNA-binding domain (Fig. 1 A| were isolated first. Hie portions 
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^ AGW < Fl * » A) were isolated subsequently using the 
respective gene-specific genomic probes (probes 3-S FiTlBI 
probe 3 iACLl), a 0.41-Jcb EcoU-Bgla and 

< fr ° m P < 2J 1 ! > ^ ,i " robe 4 WCUl * 2.0-kb «X faj 
ment (from pCIT1243); and probe S (ACL4|, a 1.0-kb EcoW 
B g m fragment (from pCIT4244,. The ACLlAGll tand 5Sj 
genomic sequences were determined, and each agrees with the 
corresponding cDNA sequences on both sides of the £oW sites 
Furthermore, additional AGL6 cDNA clones were isolated 
^ the nrst cDNA clone <pCm209) as a probe . Th^owu^ 
m ^ KgUre 1A: ^ Gli - PCm241(5'| and 42W 

L A^ G ^P CIT3228 ,5 '> and 4221 < 3 ')' pcrmso (S'l 

oCnW PfT 2299 »«*mg the MADS box] is a subclone of 
nC^n , con , talnln « a «gion 3' of the Seal site (Fig. 1A). 
a J?1 2 ™ < lackjn S *e MADS box) contains a port on of an 

cDNA 3' of the Xhol site (Fig. 1 A); and P CrT42H is 
same as pClT4233 except at the 3' end where pCIT4214 lack S 
nucleotides and the poly(A| tail. "«ui4iac*23 

DNA sequencing and RFLP analysis 

S^fm^; ? d /^. NA were subcloned into 

pGEM3Zf( + and pGEM7Zf| + 1 vectors (Promega) for sequenc 
wg. Sequence was done using the Sequenasfkit fuTml 
chemical) according to the provided protocol. Both strands were 
sequenced, unless otherwise noted 

for AGL1, a 1.8-kb Bgffi fragment from cosmid clone PCIT1210 

E\Sd"# H rCVeal 3 ^ P° , ^°'Phism between the Shim 
bia and Niederzenz ecotypes of Arabidopsis, and it was usedTo 
probe falters carrying DMAs from one of the crosses ° 

RFLP f 3P lCbaa * « aL 198 «) SinuUrly, a M3 kb 
fcoRI fragment from AGL2 cosmid pCIT1243 revealed .n 
&oRI polymorphism between Columbia Inl Sel^ 

%?T n d w Wa ? USCd l ° probe ^ the same^T 

Addrtiona hybridization of fUters with DNAs froma^os^ be- 
tween Columbia and Niederzenz [Xbal r^lymorpWsmUnd a 

t-oang et al. 1988) were performed using an AGL2 Bgffi—EcoRl 
fragment (subclone P CIT1273 from cosmid P Crn243| as a 
probe. For AGI3 a subclone (pCm29I)with a Jfi fr« 

S w^CoTuTh '^"uncovered « *«» PolvmorK, 
between Columbia and Niederzenz ecotypes and a nolv 
morphism between Landsberg and Niede^enz ewXtrf 
was used to probe appropriate fUters. The data from WA bfot 
hybndaauon experiments were analyzed using the MAP 
MAKER computer program (Lander et al. 1987) to ob Jinhnk- 
age information with respect to existing markers on th?RFLP 
map (Chang et al. 1988). 

RNA analyses 

PolyfA) ♦ RNA was isolated from developing seed pods 13-5 da v* 
^er pollination, floral buds (stages 1-Vsm^ « il S 
floral stems, and leaves, according to procedures described^' 

MA^ZIT WM Pr0bed "«W«NA of X 
n gene wh,ch is expressed in stems and leaves as weil« 
in flowers (Ma et al. 1990). The following plasmidsTsee fT b f 
unless otherwise noted, the entire mserttL usedTw^re !sed to 
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probe synthesis:pCIT5«5 [AG, with putative DNA binding do- 
mam , pCIT4219 [AGL1], pCIT4221 \AGL2, 3" Ndel-tooUt \Z 
^ P ^ 4233 lAGU > S **-W fragment! POT2299 

£™ L PC ^ 421 ° lAGL6 > 3 ' »**»W fragmen ) pOT22M 
IACI3) and pClT857 IGPA1, see Ma et JHwi'io?™ 
cross-hybndization, the probes for AGL1, AGL2 AGlZagu, 
correspond to carboxy-terminal portion of the protets 
conserved and 3' nontranslated regions. HybriaCtion^ ^were 

AC mRNA present 111 the total flower RMai ^ ! 
sized RNA from AC£ cDN^S, tt 2£5 
on the same filters. The following cDNAs see F lg i aTwS 
used to synthesize RNA with the resoective 
PCTT4219 [AGL1, T7|, pClT422 1 SMI^cr^ 
«GU T7, pCrT32,6 W G15, SP6) ^CK^ag™ n 
The plasmids were linearized so that only the inserts we"'used 
as templates for RNA synthesis. 

In situ analysis was performed according to a previously de 
senbed procedure (Barker et al. 1988, C.157ews, J.^wrnan and 
E.M. Meyerowitz, unpubl.). Inflorescences nilhvoZT'hZt 
£r l -, l , 2i B0Wman " 1989 '- ^Vth et al. iSoCToweJ 
before polJmauon were fixed, embedded, and sectioned K e 
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